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1!RANSP0RTERS ANDION CHANNELS 

This inventiofi relates to nucleic acid and amino acid sequences of transpoiters and ion 
channels and to the use of these sequences m the diagnosis, treatment, and prevention of transport, 
5 neurological, muscle, immunological and cell proliferative disord^, and in the assessment of the 
effects of exogenous contpounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

BACKGROUND OF THE INVENTION 

10 Eukaryotic cells are surrounded and subdivided into fimctionaUy distinct oiganeDes by 

hydrq>hobic lipid biiayer membranes which are highly inqpenneable to most polar molecules. CbUs 
and organelles require transport protems to hspoxt and expon essential nutrients and metal ions 
including K*", NH/, Pj, SO/', sugars, and vitamins, as well as various metabolic waste products. 
Thm^pcvt proteins also play roles in antibiotic resistance, toxin secretion, ion balance. synsq>tic 

LS neurotransmission, kidney function, intestinal absorption, tumor growth, and oth^ diverse cell 
functions (Griffith, J. and C Sansom (1998) The Transporter Facts Book. Academic Press, San 
Diego CA, pp. 3-29). Transport can occur by a passive concentrarion-depradent mechanism, or can 
be linked to an energy source such as ATP hydrolysis or an ion gradient Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 

20 change that translocates the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient 

(Carrier proteins which transport a single solute from one side of the membrane to the other 
are called untporters. In contrast coupled transporters link the transfer of one solute with 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in 
the opposite direction (antipoit). For example, intestinal and kidney epithelium ccmtains a variety 
of symporter systems driven by the sodium gradient that exists across the plas ma membrane. 
Sodium moves into the cell down its electrochemical gradient and brings the solute into the cell 
with it. The sodium gradient that provides die driving force for solute uptake is maintamed by the 

30 ubiquitous Na^/K'*^ ATPase system. Sodium-coupled transporters include the mammalian glucose 
transporter (SGLTl), iodide transporter (NIS), and multivitamin transporter (SMVT). All three 
transporters have twelve putative transmembrane segments, extracellular glycosylation sites» and 
cytoplasmically-oriented N- and C-tenmni. MIS plays a crucial role in the evaluation, diagnosis, 
and treatment of various thyroid pathologies because it is the molecular basis for radioiodide 

35 thyroid-imaging techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, 
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O. et al (1997) Proc. NaU. Acad- Sci. USA 94:5568-5573). SMVT is expressed in the intestinal 
mucosa, kidney, and placenta, and is implicated in the transport of the water-soluble vitamins, e.g., 
biotin and pantothenate (Prasad* P.D. et al (1998) J. Biol. Chem. 273:7501-7506). 

One of the largest families of transporters is the major facilitator superfamily ^IFS), also 
5 called the uniporter-symporter-antiporter family. MPS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MPS aie found in all 
classes of living organisms^ and include transporters for sugars, oligosaccharides, phosphates* 
nitrates^ nucleosides, monocarboxylates, and drugs. MPS transporters found in eukaryotes all have 
a structure comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol Molec. Biol 

10 Rev. 62:l-*34). The largest family of MPS transporters is the sugar transporter family, which 

includes the seven glucose transporters (OLUTl-^GLUTT) found in humans that ace required for the 
transpoct of glucose and other hexose sugars. These glucose transport proteins have unique tissue 
distributions and physiological functions. GLUTl provides many cell types with their basal glucose 
requirements and transports glucose across ^itheUal and endothelial bmier tissues; GLUT2 

IS facilitates glucose tiptake or efBux fiom the liver; OLUT3 regulates glucose supply to neurons; 
GLUT4 is responsible for insulin-iegolaied glucose disposal; and CaLUTS regulates fractose uptake 
into skeletal muscle. Defects in glucose transporters aie n&volved in a recently identified 
neurological syndrome causing infantile seizures and developmental delay, as well as glycogen 
storage disease, FanccHU-Bickel syndrome, and non^-insulin-dcpendent diabetes mellitus (MuecU^, 

20 M. (1994) Eur. J. Biochem. 219:713-725; Longo, N. and Hsas (1998) Adv. Ptediatr. 45:293- 
313). 

Monocarboxylate anion transporters are proton-coupled symporters with a bfoad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isofonns have been identified to date. Theisoformsaie 

25 predicted to have twelve transmembrane CTM) heUcal domains with a large tntraceUular loop 
between TM6 and TM7, and play a critical role in maintaining intcaceliular pH by temoving the 
protons that ate produced stoichiometrically with lactate during glycolysis. The best characterized 
IT-monocarboxylate transporter is that of the eiy throcyte membrane, which transports L-Iactate and 
a wide range of other aliphatic monocarboxylates. Other cdls possess IT-Unked monocarboxylate 

30 transporters widi differing substrate and inhibitor selectivities. In particular, cardiac muscle and 
tumor cells have transporters that differ in thdr K„, values for certain substrates, including 
stereoselectivity for Lr over D-lactate, and in their sensitivity to inhibitors. Th^ are 
Na^-monocarboxylate cotransporters on the luminal surface of mtestinal and kidney epithetia, which 
allow the uptake of lactate, pyruvate, and ketone bodies in these tissues. Ja addition, there are 

35 specific and selective transporters for organic cations and organic anions in organs including the 
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kidney, intestine and liver. Organic anion transporters ate selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the ammoniom 
transporter, mediate the secretion of a variety of drugs and endogenous metabolites, and contribute 
to the mamtenance of intercellular pH (Poole, R,C and A.P. Halestrap (1993) Am I. Physiol. 

5 264:Cr761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelie, K. and L 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-bmding cassette (ABCO transports are txmibGis of a superfEonily of membrane 
proteins that transport substances rangsng from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and conq)lex hydrophobic drugs. ABC 

10 transpcarteis consist of four modules: twonucleotide-bindingdoinMns(NBDXv^luchfaydrolyze 
ATP to supply the energy required for transport, and two membrane-spanning domains (MSD), each 
contahung six putative transmembrane segments. These four modules may be encoded by a sin^e 
gene, as is the case for the cystic fibrosis transmenihrane regulator (CFIR), or by separate genes. 
When encoded by separate genes, each gene product contains a smgle NBD and MSD. These *'half- 

15 molecules^' faan. homo- and hetmdimras, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MDKC) peptide transport system. Several genetic (Useases are attributed to 
defects m ABC transportors, such as the following diseases and Aeir corresponding protems: cystic 
fibrosis (CPTR, an ion channel), adrenoleukodystio^y (adrenoleukodystrophy protem, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic 

20 hypoglycemia (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) 
protein, another ABC transports, inhuman cancer cells makes the cells resistant to a variety of 
cytotoxic drugs used in chemotherapy (Taglicht, D. and S« Micbaelis (1998) Meth. EnzymoL 
292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
25 selenium, nickel, and chromium are important as cof actors for a number of enzymes. For exainple, 
copper is involved in hemoglobin synthesis, c<Hmective tissue metabolism, and bone development, 
by acting as a cofactor in oxldoreductases such as superoxide dismutase^ ferroxidase 
(ceruloplasmin), and lysyl oxidase. Copper and other metal ions must be provided in the diet, and 
are absorbed by transporters in the gasux}intestinal tract Plasma proteins transport the metal ions to 
30 the liver and other target organs, where specific transporters move the ions into cells and cellular 
organelles as needed. Imbalances in metal ion metabolism have been associated with a number of 
disease states (Danks, DM, (1986) J. Med. Genet 23:99-106). 

P-type ATPases comprise a class of cation-transporting tnuismembrane proteins. They are 
integral membrane proteins which use an aspartyl phosphate intermediate to move cations across a 
35 membrane. Features of P-type ATPases include: (i) a cation channel; (ii) a stalk, fomied by 
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extensions of the ^smembrane a-helices into the cytoplasm; (iii) an ATP binding domain; (iv) a 
phosphorylated aspardc acid; (v) an adjacent transduction domain; (vi) a phosphatase domain, 
which lemoves the phosphate from the aspartic acid as pait of the reaction cycle; and (vii) six or 
more transmenibrane domains. Included in this class are heavy metal-transporting ATPases as weU 
S as aminopho^hoiipid transporters. 

The transport of phosphatidylserine and phosphatidylethanolamine by ammophospholipid 
translocase results in the movement of these molecules from one side of a bilayer to another. This 
transport is conducted by a newly identified subfamily of P-type ATPases which axe proposed to be 
amphipadi transporters. Ampldpath tranq)ort»rs move molecules having both a hydrq>hilic and a 

10 hydrophobic region. As many as sevrnteendiffmnt genes belong to diis P-type ATPases 

subfamily, being grouped into sev^ distinct classes and subclasses (Halleck, M.S. et al., (1999) 
Physiol. Genomics 1:139-150; Vulpe,C Bt al., (1993) NaL Genet 3:7-13). 

Tnuisp(»t of fotty acids across the plasma menibrane can occur by diffusion* a hi^ 
ci^adty, low afiBnily process. However, under normal physiological conditions a significant 

IS fraction of fatty acid transport appears to occur via a high afBnity, low capacity protean-mediated 
tranqKxrt process. Fatty add transport protein 0PATP), an integral mi^^^ 
transmembrane segments, is expressed in tissues exhibiting high levels of plasma membrane fatty 
acid flux, such as muscle» heart* and adipose. Expression of FATP is upiegulated m 3T3-L1 cells 
during adipose conversion, and expression in C0S7 fibroblasts elevates uptake of long-chain fatty 

20 acids (Hui, T.Y. et al. (1998) J. Biol. ChenL 273:27420-27429). 

The lipocalin superfamHy constitutes a phylogenetically conserved group of more than forty 
proteins that function as extracellular ligand-binding proteins which bind and transport small 
hydrophobic molecules. Members of this family function as carriers of retinoids, odorants> 
cbromophores* pheromones, allergens, and sterols, and in a variety of processes including nutrient 

25 transport, cell growth regulation, immune response, and prostaglandin synthesis, A subset of these 
proteins may be multifunctional, servmg as either a biosynthetic enzyme or as a specific enzyme 
inhibitor, (Tanaka, T. et al. (1997) J, BioL Chem. 272:15789-15795; and van't Hof, W. et al. (1997) 
L Biol. Chem. 272:1837-1841.) 

Members of the lipocalin family display unusually low levels of overaU sequence 

30 conservation. Pairwise sequence identity often falls below 20%* Sequence similarity between 
family members is limited to conserved cysteines which form disulfide bonds and thiee motifs 
which form a juxtaposed cluster that fimctions as a target cell recognition site. The lipocalins share 
an eight stranded, anti-paiallel beta-sheet which folds back on itself to form a continuously 
hydrogen-bonded beta-bairel. The pocket formed by the barrel functions as an internal ligand 

35 binding site. Seven loops (LI to L7) form short beta-hakpins. except loop LI which is a large 
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omega loop that fonns a lid to partially close die internal ligand-binding site (Flower (1996) 
Biochein.J.318iM4), 

Lipocalins am iiiq)ortant transport molecules. Each lipocalin associates with a particular 
iigand and delivers that ligand to appropriate target sites within the organism. Retinol-binding 
5 protein (RBP), one of the best charac^ized lipocalins, transports retmol from stoies within the liver 
to target tissues. Apolipoprotein D (apo D), a connponent of high density lipoproteins OES>Ls) and 
low density lipoproteins (LDLs), fimctions in the targeted collection and delivery of cbblesterol 
throughout the body, lipocalins are also involved in cell regulatory processes. Apo D, which is 
identical to gross-^ystic-disease-fluid protein (OCDFP)-24, is a progest^ne/pregoenolone-binding 

10 protein expressed at hi||i levels in breast cyst fluid Secretion of apo Din certain human breast 
cancOT cell lines is accompanied by reduced cell proliferation and progression of cells to a more 
differentiated phenotype. Sinnlarly, ^ D and another l^ocalin, a|-acid glycoprotdn (AGP)» are 
involved in nerve cell regeneration. AGP is also involved in antiinflammatory and 
immimosuppressive activities. AGP is one of the positive acute-phase proteins (APP); circulating 

IS levels of AGP increase in response to stress and inflammatory stimulation. AGP accumulates at 
sites of inflammation wh«e it inhibits platelet and neutrophU activation and inhibits phagocytosis. 
The immunoiiiodulatory prop^ties of AGP are due to glycosylation. AGP is 40% carbohydrate, 
making it unusually acidic and soluble* The glycosylation pattern of AGP changes during acute- 
phase response, and deglycosylated AGP has no immunosuppressive activity (Flower (1994) FEBS 

20 Lett 354:7-11; Flower (1996) sUESa)- 

The lipocalin superfamily also includes several animal allergens, including the mouse major 
urinary protein (mMUP), the rat a-24mcragloobulin (iA2U), the bovine p-lactoglobulin (pig), the 
cockroach allergen (Bla g4), bovine dander allergen (Bos d2), and the major horse allergen, 
desi^ated Equus caballus aUergen 1 (Equ cl). Equ cl is a powerfiil allergen responsible for about 

25 80% of anti-horse IgB antibody response in patients who are chronically exposed to horse allergens. 
It appears tiiat lipocalins may contain a common structure that is able to induce the Ig^ response 
(Oregoire, C et al., (1996) J. Biol. Chem. 271:32951-32959). 

Lipocalins are used as diagnostic and prognostic markers in a variety of disease states. The 
plasma level of AGP is monitored during pregnancy and in diagnosis and prognosis of conditions 

30 including cancer chemotherapy, renal disfunction, myocardial infarction, arthritis, and multiple 
sclerosis. RBP is used clinically as a marker of tubular reabsorption m the kidney, and apo D is a 
marker in gross cystic breast disease (Flower (1996) supraV Additionally, the use of lipocalin 
animal allergens may help in the diagnosis of allergic reactions to horses (Gregoire supra) , pigs, 
cockroaches, mice and rats. 

35 Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions 
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and charged metabolites between the cy tosol and the mitochondrial matrix. Examples include the 
ADP, ATP carrier protein; the 2-oxoglutarate/maIate camen the phosphate earner protein; the 
pyruvate carder, the dicarboxylate carrier which transpotts malate, succinate, fiimarate, and 
phosphate; the tricarboxylate carrier which transports citrate and malate; and the Grave's disease 
S carrier protein, a protein recognized by IgG in patients with active Grave's disease, an autoimmune 
disorder resulting in hyperthyroidism. Proteins in tihis family consist of three tandem repeats of an 
approximately 100 amino acid domain, each of which contains two transmembrane regions (Stryer, 
L. (1995) Hy^iftmyRfry. W JH. Freeman and Conqiany, New York NY, p. 551; PROSUB 
PDOC00189 Mitochondrial energy transfer proteins signature; Qnlme Mendelian bheritance in 

10 Man (OMINO'^TSOOO Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial menibiane« thus uncoupling oxidative phosphorylation 
ficom ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have heexk implicated as modulators of tbermoregularion and noetabolic rate, and have been 

15 proposed as potential targets for drugs against metabolic diseases such as obesiQr (Ricquier, et al* 
(1999) L hkt Med. 245:637-^2). 
loD Chamiels 

The electrical potential of a cell is generated and maintamed by controllmg the niovement 
of ions across the plasma membrane. The movem^t of ions requires ion channels, which form ion- 

20 selective pores within the membrane. There are two basic types of ion channels, ion transporters 
and gated ion channels. Ion trmsporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the lon^s concentration gradient Gated ion channels allow passive flow of 
an ion down the ion's electrochemical gradient under restricted conditions. Together, these types of 
ion channels generate^ mabitain, and utilize an electrochemical gradient that is used in 1) electrical 

25 inq>ulse conduction down the axon of a nerve cell, 2) transport of molecules into cells against 
concentration gradients, 3) initiation of muscle contraction, and 4) endocrine ceU secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion*s concentration gradient 

30 These transmembrane ATPases are divided into three families. The phosphorylated (?) class ion 
transporters, including Na*-K* ATPase, Ca^-ATPase, and H^-ATPase, are activated by a 
phosphorylation event, P-class ion transporters, also known as E1-E2 type ATPases, are responsible 
for maintaining resting potential distributions such that cytosolic concentrations of Na^ and Ca^* are 
low and cytosolic concentration of is high. The vacuolju (V) class of ion transporters includes 

35 H'*' pumps on intracellular organelles* such as lysosomes and Golgi. V-class ion transporters are 
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responsible for generating the low pH within the lumen of these organelles that is required for 
function. The coupling factor (F) class consists of H^punr^s in the mitochondria. F-classion 
transporters utilize a proton gradient to generate ATP from ADP and inorganic phosphate (P)» 
ITie P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 
5 several large cytoplasmic regions that may play a role in ion binding (Scarborough, Q.A. (1999) 
Curr, Opin, Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the 
V| domain, a peripheral complex responsible for ATP hydrolysis; and the Vq domain, an integral 
complex responsible for fnoton translocation across the membrane. The F-^ATPases are structurally 
and evolutionarily xelaied to the V-ATPases. The F-ATPase Fp domain contains 12 copies of the c 

10 submdt, a highly hydrophobic protein composed of two transmemhtane donoains and containing a 
single buried carboxyl group in TM2 that is essential for proUimtiansp^ TheV^ATPftse Vq 
donuin contains three types of homologous c subumts with four or five transnnembtaiie domains 
and the essential carboxyl group in TM4 or TM3. Both types of complex also contain a sin^e a 
subunit that may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. 

15 Biol. Chem. 274:12951-12954). 

The resting potential of Ae cell is ut^i^6d in ntiany processes mvolving carrier protdns and 
gated ion channels. (Carrier proteins utilize the resting pot^tial to transport molecules into and out 
of the cell. Amino acid and glucose transport into many cells is linted to sodium ion co-transport 
(symport) so that the movement of Na**^ down an electrochemical gradient drives transport of the 

20 other molecule up a concentration gradient Similarly, caitUac muscle links transfer of Ca ^ out of 
the cell with transport of Na^ into the cell (antiport). 

Oat?jfaqqinaim^te 

Gated ion channels control ion flow by regulating tiie opening and closing of pores. Hie 
ability to control ion flux through various gating mechanisms allows ion channels to mediate such 

25 diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle 

contraction* fertilization, and regulation of ion and pH balance. Gated ion channels are categorized 
accorcUng to the manner of regulating the gating function. Mechamcally-gated channels open their 
pores in response to mechanical stress; voltage-gated channels (e.g., Na\ K^, Ca^, and Cr channels) 
open their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 

30 acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA^ and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitt^. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliaiy channel proteins and/or post 
translational modifications, such as phosphorylation. 

35 Mechanically-gated or mechanosensirive ioa channels act as transducers for the senses of 
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touch, hearing* and balance, and also play impoitant roles in cell volume regulation, smooth muscle 
contractton, and cardiac rhythm generation* A stretch-inactivated channel (SIC) was recently 
cloned from rat kidney. The SIC channel belongs to a group of channels which are activated by 
pressure or stress on the cell m^brane and conduct both Ca^^ and Na^ (Suzuki« M. et al. (1999) J. 
5 BioLChentL 274:6330-6335). 

The pore-fomnng subunits of the voltage-gated cation channels form a super&mily of ion 
channel proteins. The characteristic domam of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-fonning region (P) located between SS and S6^ and intracellular amino and 
carboxy termini. ]ntfaeNa^andCa^sub&niilies»thisdonuiinisrq)eatedfourti^ 

10 channel subfamily* each channel is formed from a t^ramer of either identical w dissimilar 
subunits. The P region contamsuiformationq>edfying the ion sdectivi^ Lithe 
case of channels, a GYG tripeptide is involved in diis selectivity (Ishii, TM. et al. (1997) Proc. 
Nafl. Acad. Sci. USA 94:11651-11656), 

Voltage^f^ated Na**^ and K**^ channels are necessary ftx the function of electricaUy excitable 

15 cells, such as nerve and muscle cells. Action potentials* which lead to neurotFansmitter release and 
muscle contraction, arise from large, ttansietit changes in the permeability of die membrane to Na*^ 
and ions. Depolarization of the membrane beyond die threshold level opens voltage-gated Na'^ 
channels. Sodium ions flow into the ceU, further depolarizing the meinbrane and 
voltage-gated Na* channels, which propagates the depolarization down the length of the cell* 

20 Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarizaticni of the memfarane. Voltage-gated channels utilize charged 
residues in the fourth transmeixdiratte segment (S4^ The open state lasts 

only about 1 millisecond^ at which time the chaiuiel spontaneously converts into an inactive state 
that cannot be opened irrespective of the membrane potential, hiactivation is mediated by the 

25 channel' s N-tenninus» which acts as a plug that closes the pore. The transition from an inactive to a 
closed state requires a return to restiiig potential. 

Voltage-gaied Na* channels are heterotrimeric coiqplexes composed of a 260 kDa pore- 
fonmng a subunit that associates with two smaller auxiliary subunits* pi and p2. The p2 subunit is 
a integral membrane glycoprotein that contains an exn^cellular Ig domain, and its association with a 

30 and pi subunits correlates with increased functional expression of the channel, a change in its 
gating properties, as well as an increase in whole cell capacitance due to an increase in membrane 
surface area (Isom, L.L. et al. (1995) CeU 83:433^2). 

Non voJtage-gated Na* channels mclude the members of the amiloride-sensitive Na* 
channel/degenerin (NaQDEG) family. Channel subunits of this family are thought to consist of two 

35 transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini 
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located within the cell. The NaCYDEG family includes the epithelial Na* channel (BNaC) involved 
in Na* icabsorption in epithelia including the airway, distal colon, cortical collecting duct of the 
kidney, and exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 
and Liddle's syndrome (pseudohyperaldosteronism). The NaC/DEO family also includes the 
5 recently characterized IT-gated cation channels or acid-sensing ion channels (ASIC). ASIC 
subunits are e^qpressed in the brain and form hetraomultimeric Na^-penneable channels. These 
channels require acid pH fluctuations for activation, ASIC subunits show homology to the 
degenerins, a family of mechanically-gated channels originally isolated from C elegans. Mutations 
in the deg^mns cause neurodegen^Btion. ASIC subunits may also have a role in neuronal 
10 function^ or m pain perception, since tissue acidosis causes pain (Waldmann, R. and M« Lazdunski 
(1998) Curr. Opui. NeurobioL 8:418-424; Egl^ RJM. et al. (1999) Trends Phannacol, Sci. 20:337- 
342), 

channels are located in all cell types , and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca^ and cAMP* In non-excitable tissue, channels 

15 are involved in protein synthesis, control of endocrine sec^ons, and the maintenance of osmotic 
equilibrium across membnuies. In neunms and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, K'** channels are responsible for setting the resting 
memhcane potential. The cytosol contains non-diffusible anions and, to balance this net negative 
charge, the cell contains a Na^-K^ pump and ion channels that provide the redistribution of Na^ K\ 

20 andQ'. The punqi actively transports Na^ out of the ceU and into the cell in a 3:2 ratio. Ion 
channels in the plasoia membrane aUowK*' and Q~ to flow by passive diffiisi^ Becauseofthe 
high negative charge within the cytosol, d' flows out of the cell. The flow of is balanced by an 
electromotive fuce pulling hito the cell, and a ccmcentration gradient pushing out of the 
celL Thus, the resting membiane potential is primarily regulated by K^flow (Salkoff, L and T. 

25 Jeghi (1995) Neuron 15:489-492). 

Potassium channel subunits of the Shako^^ -Uke sup^amily all have the characteristic six 
transoiembiane/l pore domain structure. Four subunits combine as homo- or hetmtetram^ to 
fomi functional K channels. These pore-fomung subunits also associate with various cytoplasmic fi 
subunits that alter channel mactivation kinetics. The Shaker- like channel family famludes the 

30 voltage-gated channels as well as the delayed rectifier type channels such as the human ether-a- 
go-go related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. 
(1998) Cuir. Opin. BiotechnoL 9:565-572; Kaczoiowski, G.J. and MX. Garcia (1999) C^otr* Opin. 
Ghent Bid. 3:448458). 

A second supcrfamily of channels is composed of d:iB inward rectifying channels (Kir). 

35 Kir channels have the property of preferentially conducting K"*^ currents in the inward direction. 
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These proteins consist of a single potassium selective pore domain and two transmembrane 
domains* which correspond to the fifth and sixth transmembrane domains of voltage-gated 
channels. Kir subunits also associate as tetramers. The Kir family includes ROMKl, mutations in 
which lead to Bartter syndrome, a renal tubular disorder. Kir channels are also involved in 

S regulation of cardiac pacemaker activi^» seizutes and epilepsy, and insulin regulation (Doupnik, 
C.A. et al (1995) Cutr. Opin. NeurobioL 5:268-277; Curran, supiaV 

The lecently recognized TWIK channel fiunily includes the mammalian TWIK-1, 
TREK-1 and TASK proteins* Members of this family possess an overall structure with four 
transmembrane domains and two P domains. These proteins are probably mvolved in controlling 

10 the resting potential m a laige set of cell types (Duprat, F. et al. (1997) EMBO 3 16:5464-5471). 

The voltage-gated Ca ^ channels have been classified into several subtypes based upon their 
electrophysiological and {tomacological characteristics, L-type Ca^ channels are pcedcHuinantly 
expressed in heart and skeletal muscle where tibey play an essential role in excitation^-contraction 
coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 

15 channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L^ype and N-type voltage-gated Ca^ channels have be^ purified and, though their 
functions differ dramaticallyt they have similar subunit con^ositions. The channels are composed 
of three subunits. The a, subunit forms the membrane pore and voltage sensor, while the aji and p 
subunits modulate the voltas^-dependence, gating properties, and the current amplitude of the 

20 channel. Thesesubunitsareencodedby at least six a,, one 025. and four p genes. A fourth subunit, 
y, has been identified m skeletal muscle (Walker, D. et al. (1998) I. BioL Ghent 273:2361-2367; 
McQeskey, E.W. (1994) Curt. Opin. NeurobioL 4:304-312). 

The higjb-voltage-activated Ca ^ charmels that have been characterized biochemically 
hiclude complexes of a pare-fomung alphal subunit of approximately 190-250 kDa; a 

25 transmenibrane corrq>lex of alpha2 and delta subunits; an intracellular beta subunit; and in some 
cases a transmembrane gamma subunit A variety of alphal subunits, alpha2delta complexes, beta 
subunits, and gamnm subunits are known. The Cavl family of alphal subunits conduct L-type Ca ^ 
currents, which initiate muscle contraction, endocrine secretion, and gene transcription, and are 
regulated priniariiy by second messenger-activated proteui phosphorylation pathways* The Cav2 

30 family of alphal subunits conduct N-type, P/Q-type. and R-type Ca ^* cuixents, which initiate rapid 
synaptic transmission and are regulated primarily by direct interaction with G proteins and SNARE 
proteins and secondarily by protein phosphorylation. The Cav3 family of alphal subunits conduct 
T-type Ca currents, which are activated and inactivated more rapidly and at more negative 
membrane potentials than other Ca current types. The distinct structures and patterns of 

35 regulation of these three families of Ca ^* channels provide an array of Ca entry pathways in 
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tesponse to changes in membrane potential and a rang^ of possibilities for regulation of Ca ^* entry 
by second messenger pathways and interacting proteins (Catterall, W. A. (2000) Atmu. Rev. Cell 
Dev. BioL 16:521-555). 

The alpha-2 subunit of the voltage-gated Ca ^^-channel may include one or more Cache 
5 domains. An extracellular Cache domain may be fused to an intracellular catalytic domain, such as 
the histidine kinase, IT2C phosphatase, GGDEF (a predicted diguanylate cyclase), HD-GYP (a 
predicted phosphodiesterase) or adenylyl cyclase domaixi, or to a noncatalytic domain, like the 
mediyl-accepting, DNA-bmding winged helix-tum-helix, GAP, PAS tx HAMP (a domain found in 
istidine kinases, denylyl cyclases, ethyl-binduig proteins and phosphatases). Small molecules are 

10 bound via the Cache domain and this signal is converted into diverse ouQ>uts depending on the 
intracellular domams (Anantharaman, V. and Aravind» L.(2000) Trends Biochem. Sci. 25:535-537). 

The transient receptor faniily (Trp) of calcium ion channels are thought to medi 
capacitative calcium entiy (CCB). CCE is the Ca^ influx into cells to resupply Ca^^ stores dq>leted 
by the action of inositol triphosphate (IP3) and other agents in response to numerous hormones and 

IS growdi foctors. Trp and Trp-Iiloe were first cloned from Drosophila and have sinulaiity to voltage 
' gated Ca ^ channels in the S3 through 56 regions. This suggests that Trp and/or related proteins 
may form mammalian CCE channels OZbu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chemu 272:29672-29680). Melastatin is a gene isolated m both die mouse and human, 
whose expression in melanoma cells is inversely conelated widi melanoma aggressiveness in vivo. 

20 The human cDNA transcript corresponds to a 1533-amino acid protein having homology to 
membmoftheTrp&mily. It has been proposed that the combined use of malastatin niRNA 
expression status and tumor thickness might allow for the determination of subgroups of patients at 
both low and high risk for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. OncoL 
19:568-576). 

25 Cailoride channels are necessary in endocrine secretion and in regulation of cytosolic and 

organelle pH. In secretory epithelial cells, CI ~ enters the cell across a basolateral n^mbrane through 
an Na K7C1 ' cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI" from the apical surface, in response to hormonal stimulation, leads 
to flow of Na* and water into the secretory lumen. The cystic fibrosis transmembrane conductance 
30 regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal 
genetic disorder in humans. CFTR is a member of the ABC transporter family, and is composed of 
two domains each consisting of six transmembrane domains followed by a nucleotide-binding site. 
Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difGcult to clear. 
The resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and 
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devastating "chronic obstructive pulmonary disease" (AI-Awqati, Q, et aL (1992) J. Exp. Biol. 
172:245-266). 

The voltage-gated chloride channels (CLC) axe characterized by 10-12 transmembrane 
domains, as weU as two sniaU globular domains known as CBS domains. The CLC subunits 
5 probably function as homotetramers. CLC proteins are involved in regulation of cdl volume, 
membrane potential stabilization, signal transduction, and transepithdial transport Mutations in 
CLC-1, expressed predominandy in skeletal muscle, are responsible for autosonml recessive 
generalized myotonia and autosomal dominant myotonia congenita, while mutations in the kidney 
chaimel CLC-5 lead to kidney stones (Jentsch, TJ. (1996) Curr. OpisL NeurobioL 6:303-310). 

10 ligand-gated channels op&x their poies when an extracellular or mtracellular mediator 

binds to the channel. Neurotransmitter-gated channels are* channels that open when a 
neurotransmitt^ binds to tbeir extraceUular domain. These channels exist in the postsynaptic 
membrane of nerve or muscle cells. Theie are two types of neurotransmitter-gated channels. 
Sodium channels opm in response to excitatory neurotransmitters, such as ace^lcholine, glutamate, 

15 and serotomn. This opening causes an influx of Na^ and pnxiuces the initial localized 

depolarization that activates the voltage-gated channels and starts the action potential. Chloride 
channels open m response to inhibitory neiuotransmitters, such as Y^ammobutyiic acid (GABA) and 
glycine, leading to hyperpolarization of the membrane and die subsequent generation of an action 
potential. Neurotransmitter-gated ion channels have four transmembrane domains and probably 

20 function as pentameis (Jentsch, supra) . Amino acids in the second transmembrane domain appear to 
be inqwrtant in determining channel pmneation and selectivity (Sather, W-A. et al. (1994) Cuir. 
Opin. NeurobioL 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For exan^le, 
calcium-activated K* channels are gated by internal calcium ions. In nerve cells, an influx of 

25 calcium during depolarization opens channels to modulate the magnitude of the action potential 
(Ishi et al., supra) . The large conductance (BK) channel has been purified firom brain and its subunit 
composition determined. The a subunit of the BK channel has seven rather than six transmembrane 
domains in contrast to voltage-gated K* channels. The extm transmembrane domain is located at 
the subunit N-terminus. A 28-amino-acid stretch in the C-terminal region of the subunit (the 

30 "calcium bowl" region) contains many negatively charged residues and is thought to be the region 
responsible for calcium binding. The p subunit consists of two transmembrane domains connected 
by a glycosylated extracellular loop, with intracellular and C-termini (Kaczorowski, supra: 
Vergara, C. et al. (1998) Curr. Opin. NeurobioL 8:321-329), 

Cyclic nucleotide-gated (CNG) channels ate gated by cytosolic cyclic nucleotides. The best 

35 examples of these are the c AMP-gated Na ^ channels involved in olfaction and the cGMP-gated 
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cation channels involved in vision. Both systems involve ligand-mediated activation of a G-protein 
coupled receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also 
represent a major pathway for Ca^* entry into neurons, and play roles in neuronal development and 
plasticity. CNG channels are tetramers containing at least two types of subunits, an a subunit which 
5 can form functional homomeric channels, and a P subunit, which modulates the channel properties. 
AU CNG subunits have six transmembrane domains and a pore forming region between the fifth and 
sixth transmembrane domains, similar to voltage-gated channels. A large C-tenninal domain 
contains a cyclic nucleotide binding domain, while the N-terminal domain confers variation among 
Channel subtypes (Zufall, R et al. (1997) Curr. Opin. NeurobioL 7:404^12). 

10 The activity of other types of ion channel proteins may also be modulated by a variety of 

intracellular signalling proteins. Many channels have sites for phosphorylation by one or more 
prot^ Icinases including protem Idnase A, protein kinase C, ^rosine Idnase, and casein kinase n, 
all of which regulate ion channel activity in cells. Kir channels are activated by the binding of the 
Gpy subunits of beterotrim^c Gi^roteins (Reimann, R and FJ^ Ashcrof^ (1999) Cmr. Opin. Cell. 

15 Biol 1 1:503-508). Other proteins are mvolved in the localization of ion channels to specific sites m 
the cell membrane. Such proteins include the PDZ domahi ptoteins known as MAGUKs 
(membrane-associated guanylate kmases) which regulate the clustering of ion channels at neuronal 
synapses (Graven^ S.E. and D.S. Bredt (1998) Cell 93:495-498). 
Disease Conrelatioii 

20 The etiology of numerous human diseases and disorders c^ be attributed to defects in the 

transport of molecules across memtemes. IMects in the trafficking of membrane-bound 
transporters and ion channels are associated with several disofders. e.g., cystic fibrosis, 
glucose-galactose malabsofption syndrome» hypercholesterolemia, von CUerke disease, and certain 
f onus of diabetes mellitus. Single-gene defect diseases resultmg m an inability to tiansport smaD 

25 molecules across menibranes include, e.g., cystinuria, unmoglycinuria, Hartup disease, and Fanconi 
disease (vant Hoff, W,G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. 
Med- 120:218-226; and Chillon. M. et al. (1995) New EngL J. Med. 332:1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous systenou Mutations m the pore-fonmng subunits of 

30 sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have be^ treated with channel blockers. 
Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, whUe 
mutations in the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle 
sodium channel cause malignant hyperthermia. Oardiac anythmia disorders such as die long QT 

35 syndromes and idiopathic ventricular fibrillation are caused by mutations in potassium and sodium 
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Channels (Cooper, E.C. and L.Y. Ian (1998) Pioc, NaU. Acad Sci. USA 96:4759-4766). All four 
known human idiopathic epilepsy genes code for ion channel proteins (Berkovic, Si^. and LE. 
Scheffer (1999) Curr. (}pin. Neurology 12:177-182). Other neurological disorders such as ataxias, 
hemiplegit migraine and hereditaiy dea&ess can also result from mutations in ion channel genes 

5 (Jen, J. (1999) Ctor. Opin, Neurobiol. 9:274-280; Cooper, supraV 

Ion channels have been the target for many drug therapies. Neurotransmittar-gated channels 
have been targeted in therapies for treatment of insomtiia, anxiety, depression, and schizophrenia. 
Voltag^gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head 
trauma, and neurodegenerative disease (Taylor. CP. and L.S. Narasirnhan (1997) Adv. Pharmacol. 

10 39:47-98). Various classes of ion channels also play an inqportant role in the perception of pain, and 
thus are potential targets for new analgesics. These include the vanilloid-gated ion chaimels^ which 
are activated by the vaniUoid capsaicin, as well as by noxious heat Local anesthetics such as 
lidocaine and mexiletine which blockade voUage-gated Na^ channels have been useful in the 
treatment of neuropathic pain (Pglen, supra). Calcium^hannel protein e)^ssion is altered in 

15 metastatic melanomas (Enldaat, T. et al. (2000) Oenomics 67:179-187. 

Ion chaimels in the immune system have recently been suggested as targets for 
immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agmts can inhibit secretion of lys^holdnes, cell proliferation, and killing of target cells. A peptide 

20 antagonist of the T-cell potassium diannel Kvl .3 was found to suppress delayed-type 

hypersensitivi^ and allogenic responses in pigs, validating the idea of channel blockers as safe and 
efficacious immunosuppressants (Cahalan, MJ>. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 
8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding 
25 them, satisfies a need in the art by providing new con^ositions which are useful in the diagnosis, 
prevention, and treatment of transport neurological, muscle, innnunological and cell proliferative 
disorders, and in the assessment of the effects of exogenous con^unds on the expression of nucleic 
acid and anuno acid sequences of transporters and ion channels. 

30 SmiMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 
collectively as "TRIOT' and individually as 'TRiai-l," "TmCH-ir 'TRICH-3," *TRICH-4." 
"TRICH-Sr ^TRICH-e;* 'TRICH.7,*' 'TRICH^g," •TRJCH-9," 'TRICH40;' *TrRlCH-ll," 
•TRICH-IZ;* 'TRICH 13,** •TRICH-14," 'TRICH-15." •TRICH-16," 'TRK3K-17," 'TRICH-18," 
35 •TRICH-19,"*TRICH-20."*TWC31-21,"*TRICM^22,"'^ 
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•TRICH-26;' 'TRlCH-a?/* *TRICH-28;* *TiaCH-29;' 'TRlCH^ao;' *TRlCH^3i;' and *TRICH- 
32." In one aspectttba invention provides an isolated polypeptide selected from the 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occulting amino acid 

5 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1*32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequrace selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, die invention provides an isolated polypeptide conqirising the andno acid 

10 sequmceofSBQIDNO:l-32. 

The invention further provides an isolated polynucleotide «icoding a polypq)tide selected 
from the group consisting of a) a polypeptide conq»rising an amino acid sequence selected from the 
group consisting of SEQ ID NO: l-32» b) a polypeptide comprising a naturally occuning amino acid 
sequence at least 90% id^cal to an amino acid sequence selected from the group consisting of 

IS SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragmrait of a 
polypeptide having an amino add sequence selected from the group consisting of SEQ ID NO: 1*32. 
In one alternative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ID NO: 1-32. In anotho: alt^ative, the polynucleotide is selected from the group coimisting of 

20 SEQIDNO:33-64. 

Additionally, the invention provides a recombinant polynucleotide coiiq>rising a promote 
sequence opetsbly linked to a polynucleotide caicodnig a polypeptide selected from fhe group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 

25 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1 -32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1*32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. 

30 In another alternative, the invention provides a transgenic organism comprising the recombinant 
polynucleotide^ 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. b) a polypeptide comprising a naturally occurring amino acid 
35 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
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SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an aimno acid sequence 
selected from the gn>up consisting of SEQ E> NO; 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: l'-32. 
The method comprises a) culturing a cell under conditions suitable for expression of the 

5 polypeptide, wherein said cell is transformed with a recoihbinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide canq>rising an ammo acid 

10 sequence selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide conq[ffi$ing a 
naturally occurring amndo acid sequence at least 90% identical to an amino acid sequence selected 
from the gioup consisting of SEQ W> NO: 1-32, c) a biologtcally active fragment of a polypeptide 
having an amino add sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino add sequence selected from the group 

15 consisting of SEQ ID N0:l-3i 

The invention furth^ provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide con^rising a polynucleotide sequence selected from the group 
consisting of SBQ ID NO;33-64, b) a polynucleotide comprising a naturally occuiring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 

20 group consisting of SEQ ID NO:33-64, c) a polynucleotide coniplementary to the polynucleotide of 
a), d) a polynucleotide con^>leinentary to the polynucleotide of b), and e) an RNA equivalent of a)- 
d). In one alternative, the polynucleotide comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 

25 consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
group consisting of SBQ ID NO:33-64, c) a polynucleotide con^lementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)- 

30 d). The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formted between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization complex, and optionally, if 

35 present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
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nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
5 SEQ ID NO:33-64, b) a polynucleotide con:^rising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complen^ntary to the polynucleotide of b), and e) an RNA equivalent of a>d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

10 reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide conq>rising a 

15 naturally occtnring amino acid sequence at least 90% Identical to an amino acid sequence selected 
fit)m the group consistmg of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consistmg of SEQ ID NO:l"32, and d) an 
immunogenic fiiagment of a polypeptide having an amino acid sequ^ce selected from the group 
consisting of SEQ ID NO:l-32, and a phaimaceudcally acceptable excipient. In one embodiment, 

20 the composition coni{»rises an amino acid sequence selected bom the group consistniig of SEQ ID 
NO:i-32. The invention additionally provides a method of treating a disease or condition associated 
with decreased expression of functional TRICH, com{>rising administering to a patient in need of 
such treatment the coi>4)0siti(xi. 

The invention also provides a nae&od font screenmg a compound for effectiveness as an 

25 agonist of a polypeptide selected from the group consisting of a) a polypeptide convulsing an amino 
acid sequence selected from the group consistmg of SEQ ID NO:l-32, b) a polypeptide cornprising 
a naturally occurring amino add sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO; 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 

30 immunogenic fragment of a polypeptide having an amino acid sequence selected from flie group 
consistmg of SEQ ID NO: 1-32. Tht method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity m the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient In another alternative, the invention provides a method of 

35 treating a disease or condition associated with decreased expression of functional TRICai, 
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comprising adixiinistering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising 
an amino acid sequence selected from die group consisting of SEQ ID NO: 1-32, b) a polypeptide 
5 con^rising a naturally occurring amino acid sequence at least 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment 
of a polypeptide having an amino acid sequence selected from the group consisting of SEQ JD 
NO: 1-32, and d) an ummmogenic fragment of a polypeptide having an amino acid sequence selected 
fimm the group consisting of SEQ ID NO; 1-32. The method comprises a) exposing a sample 

10 conq>rising die polypef)tide to a conq[>ound» and b) detecting antagonist a^ In 
oa^ altetnative. die mvention provides a compositicm comprising an antagonist confound identified 
by die method and a pharmaceutically acceptable excipient banodier alternative, the invention 
provides a method of treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such tiieatmexit the composition. 

15 The invention fiirther provides a medK>d of scre^iing for a conqK^ 

bmds to a polypeptide selected from die group consisting of a) a polypeptide coiiq>ri$ing an amino 
acid sequence selected from the groi^ consisting of SEQ ID NO:l-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from die gixmp consisting of SEQ ID NO: 1-32, c) a biologically active fixigment of a polypq)tide 

20 havmg an amino acid sequrace selected from die group consistuig of SEQ ID NO:l-32, and d) an 
immunogenic fragment of a polypeptide having an amino add sequence selected from the group 
ocmsisting of SEQ ID NO: 1-32. The metiiod comprises a) combbing the polypeptide witii at least 
one test conqiound under suitable conditions, and b) detecting banding of the polypeptide to the test 
compound, tbeieby id^tifying a compound that specifically binds to the polypeptide. 

25 The mvention further provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from die group consisting of a) a polypeptide con5)rising an ammo 
acid sequence selected from the group consisting of SEQ ID NO:l-32» b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
fi^m die group consisting of SEQ ID NO:l-32. c) a biologically active fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide havmg an amino acid sequence selectsed from the group 
consisting of SEQ ID NO: 1*32. The method comprises a) combining the polypeptide with at least 
one test compound under conditions permissive for the activity of the polypeptide, b) assessing the 
activity of the polypeptide in die presence of die test compound, and c) comparing die activity of 

35 die polypeptide in the presence of die test compound with die activity of die polypeptide m die 
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absence of the test compound, wherein a change in the activity of the polypeptide in the presence of 
the test compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a taiget polynucleotide, wherein said tatget polynucleotide comprises a 
5 polynucleotide sequence selected ficom the group c(»isistmg of SEQ W NO:33-64, die method 
comprising a) exposing a sample con^rising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in the pxesfsoce of varying amounts of the compound and in the absence of the 
conq)oumL 

10 The invention further provides a method fot assessing toxicity of a test con^KMmd, said 

mediod conq>rising a) treadng a biological sanqple containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe conqnising at least 20 
contiguous nucleotides of a polynucleotide selected from the gioup consisting of t) a polynucleotide 
comprising a polynucleotide sequence selected from the group insisting of SBQ ID NO:33-64, it) 

15 a polynucleotide conq>rising a natuiaUy occmring polynucleotide sequence at least 90% identical to 
a polynucleotide sequence selected from the group consistmg of SEQ ID NO:33-64, iii) a 
polynucleodde havmg a sequence caoaplcmsataiy to i), iv) a polynucleotide complementary to the 
polynucleotide of ii). and v) an RNA eqiuvalent of iHv). Hybridization occurs under conditi(ms 
whmby a specific hybridization conq»lex is fcMmed between said probe and a target polynucleotide 

20 in the biolc>gical sample, said target polynucleotide selected from die group consisting of i) a 

polynucleotide comprising a polynucleotide sequence selected from tile group consisting of SBQ ID 
NO:33-64, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from die group consisting of ISEQ ID NO:33- 
64, iii) a polynucleotide comqplementary to the polynucleotide of i), iv) a polynucleotide 

25 con^lementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the 
target polynucleotide conqnises a fragment of a polynucleotide sequence selected from the group 
consisting of i)-'V) above; c) quantifying the amount of hybridization complex; and d) comparing the 
amount of hybridization complex in the treated biological sample with the amount of hybridization 
conq)lex in an untreated biological sample, wh^in a difference in the amount of hybridization 

30 complex in the treated biological sample is indicative of toxicity of die test compound. 



BRIEF DESCRIPTION OF THE TABLES 
Table I summarizes the nomenclature for the frill length polynucleotide and polypeptide 
sequences of the present invention. 
35 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
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homolog for polypeptides of the invention. The probability scores for the matches between each 
polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide sequences of the invention* including 
predicted motife and doniains, along with the methods, algorithms, and searchable databases used 
5 for analysis of the polypeptides. 

Table 4 lists the cDNA and/or genonuc DNA fiagments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table S shows the ieprese»itative cDNA library for polynucleotides of the mvention. 
10 Table 6 provides an appendix which describes the tissues and vectors used for construction 

. of the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with i^licable descriptions, references, and threshold 
parameters. 

15 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and n^tfaods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is fox the 

20 purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a/* 
"an," and "die** include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 

25 antibody" is a reference to one more antibodies and equivalents thereof known to those skilled in 
the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, matmals, and methods similar or equivalent to those described 

30 herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the puipose of 
describing and disclosing the cell lines, protocols^ reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 

35 prior invention. 
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DEFINITIONS 

'TRIQT' refers to the anuno acid sequences of substaatially purified TRICH obtained from 
any species^ particulady a manunalian species, including bovine, ovine, porcine, murine, equine, 
and hunoan, and from any source, wliether natural, synthetic, semi-synthetic, or recombinant 
5 The term *'agonist" refers to a molecule which inteiisifies or mimics the biological activity 

of TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any 
other compound or compositton which modulates the activity of TRICH either by directly 
interacting with TRICH or by acting on conqixinents of the Uological patihiway in which TRICH 
participates. 

10 An '^allelic variant*' is an alt^ative form of the gene encoding TRICH. Allelic variants 

may result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs 
or in polypeptides whose structure or function may or may not be Otesed. A gene may have none, 
one, or many allelic variants of its naturally occurring form. Common mutational changes which 
give rise to allelic variants are generally ascribed to natural deletions, additions, or substitutions of 

15 nucleotides. Each of these types of changes may occur alone, or in combtnaticm with tte others, one 
or more times in a given sequence. 

"Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of diffimnt nucleotides, resulting m a polypeptide the same as TRICH or 
a polypeptide with at least one functional characteristic of HQCH Included within this definition 

20 are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic 
variants, with a locus other than the noonal chromosomal locus for the polynucleotide sequence 
encoding TRICH The encoded protein may also be ""altered,*" and may contain deletions, 
insertions, or substitutions of amino acid residues which produce a silent change and result in a 

25 functionally equivalent TRICH. Deliberate amino acid substitutions may be made on the basis of 
sunilarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the biological or inmiunological activity of TRICH is retained. For 
example, negatively charged amino acids may include aspartic acid and glutamic acid, and 
positively charged amino acids may include lysine and arginine^ Amino acids with uncharged polar 

30 side chains having simitar hydrophilicity values may include: asparagine and glutamine; and serine 
and threonine. Amino acids with uncharged side chains having similar hydrophilicity values may 
include: leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms ''amino acid" and "anuno acid sequence'* refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 

35 synthetic molecules* Where "amino acid sequence" is recited to refer to a sequence of a naturally 
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occulting protein molecule, **amino acid sequence" and lite terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

'^Amplification" relates to the production of additional copies of a nucleic acid sequence. 
5 Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art 

The tmn "antagonist** refers to a molecule which inhibits or attenuates the biological 
activity of TRICH Antagonists may include proteins such as antibodies, nucleic adds, 
carbohydmtes, small molecules, or any other compound or composition which modulates the 

10 activity of TRICH eith^ by direcay interacting with TRICH or by acting on conqponents of the 
biological pathway in which TRICH participates. 

The tem "'antibody'* refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab» F(ab*>t, and Fv fi:agments, which are capable of binding an epitopic 
deterndnant Antibodies that bind TRICH polypeptides can be prep^ed using intact polyp^des or 

15 using fiagments containing small peptides of mterest as the immunizing antigra. The polypeptide 
or oligopeptide used to immunize an aninml (e.g.» a mouse, a rat, or a rabbit) can be derived from 
the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if 
desired. Commonly used carrim that are chenucally coupled to peptides include bovine serum 
albunun, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used 

20 to immunize the animal. 

The term "antigenic determinant'* refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic detenninauts (particular regions or three-dimensional structures 

25 on the protein). An antigenic determinant may compete with the intact antigen (ie., the immunogen 
used to elicit the immune response) for binding to an ^tibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Emichment), described in US. Patent No. 

30 5*270J63), which selects for target*specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide4ike molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2*-OH group of 
a ribonucleotide may be replaced by 2 or 2 -NHj), which may improve a desired property, e.g., 

35 resistance to nucleases or longer lifetime in blood. AptanK^rs may be conjugated to other molecules. 
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e.g.» a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptameis may be specifically cross-linked to their cognate ligands^ e.g., by photo-activation of a 
cross-linken (See, e.g*, Brody, E.N. and L. Gold (2000) J. Biotechuol. 74:543.) 

The tenn "intiamer'^ refers to an aptamer which is expressed in vivo. For example, a 
5 vaccinia virus^based RNA expression system has been used to express specific RNA aptamers at 
high levels in the cytoplasm of leukocytes (Blmd. et aL (1999) Pcoc. Natl Acad. Sci. USA 
96:3606-3610). 

The tenn "spiegelmef* refers to an ^tamer which includes L-DNA« L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 

10 nucleotides ate resistant to degradation by naturally occurring enzymes, which normally act cm 
substrates containing light-handed nucleotides. 

The tem "antisense'* refers to any compositioti capable of base-pairing with the ''sense" 
(coding) strand of a specific nucleic acid sequence. Antisense con^ositions tnay include DNA; 
RNA; peptide nucleic acid (FNA); oligonucleotides having modified backbone linkages such as 

15 phosphorothioates, metfaylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-inethoxyediyl sugars or 2'-methaxyethoxy sugars; or oligonucleotides having 
modified bases such as S-methyl cytosine, 2'-deoxyiiracil, or 7Kfeaza-2 -deoxyguanosine. Antisense 
molecules may be produced by any method mcluding chemical synthesis or transcriptioiL Once 
introduced into a cell, die complementary antisense molecule base-pairs with a naturally occurring 

20 nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation '^negative" or "minus" can refer to the antis^ise strand, and (he 
designation ^'positive" or "plus'' can refer to the s^se strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologicaUy active" or 

25 "immunogenic" refers to the capability of the natural, recombmant, or synthetic TWCH, or of any 
oligopeptide thereof » to induce a specific immune response in appropriate animals or cells and to 
bind with specific antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5 -AGT-3' pairs with its con^lement, 

30 3'-TCA-5'. 

A "composition comprising a given polynucleotide sequence'* and a "composition 
comprising a given amino acid sequence" refer broadly to any corr^osition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encc^ing TRICH or 
35 fragments of TRICH may be employed as hybridization probes. The probes may be stored in freeze- 



23 



wo 02/46415 



PCT/lISOl/46963 



dried foim and may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, 
the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), detergents {e.g., 
sodium dodecyl sulfate; SDS), and other conq)cments (e.g., Denhardt'^ solution, dry milk, salmon 
sperm DNA* etc.)* 

5 "Consensus sequence*' refers to a nucleic acid sequence which has been subjected to 

repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit 
(Applied Biosystems, Foster Qty CA) in the S' and/or the 3' direction, and resequenced, or which 
has been assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a 
computer program for fragment assembly, such as ttie GBLVIEW fragment assembly system (QCX3, 

10 Madison WQornirapCtJriivecsity of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence* 

"Conservative ammo acid substitutions** are those substitutions that are predicted to least 
interfere with the properties of the origmal protein, i.e., the structure and especially the fimction of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 

15 amino acids which may be substituted for an odpiial amino acid in a protein and which are 
regarded as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 


20 


Asn 


Asp, Gin, HSs 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gto 


Asn, Glu, His 




Glu 


Asp, Ghi, His 


25 


Gly 


Ala 




HQs 


Asn, Arg, Ota, Glu 




He 


Leu, Val 




Leu 


ne,Val 




Lys 


Arg,Ghi,Glu 


30 


Met 


I^u,ne 




Phe 


HQs, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 


35 


Tyr 


His, Phe, Trp 




Val 


De, Leu, Thr 



Conservative amino acid substitutions generally mamtam (a) die structure of the 
polypqidde backbone in the area of the substitution, for example, as a beta sheet or alpha helical 
40 conformation, (b) the charge or hydrophobicity of the molecule at the site of the substttution, and/or 
(c) the bulk of the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
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absence of one or more amino acid residues or nucleotides. 

The tenn "derivative** refers to a chenncally modified polynucleotide or polyp^tide. 
Chemical modifications of a polynucleotide can include, for example, replacrasent of hydrogen by 
an alky], acyl, hydroxyl, or amino group. A derivative polynucleotide eiicodes a polypeptide which 
5 retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by ^ycosylation, pegylation, or any similar process that retains at least 
one biolo^cal or immunological function of the polypeptide from which it was derived. 

A "detectable labeF* refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

10 "Differential expression** refers to increased or upregulated; or decreased, downregulated, 

. or abs^t g^e or protein eiqnressiont detenmned by conqparing at least two different sanqtles. Such 
comparisons may becaiiied out betwe^ for example, a treated and an untreated sample, or a 
diseased and a nomial 5aiiq)le. 

"Bxon shufBing** refers to the recombination of different coding regions (exons). Since an 

IS exon may repres^t a structural or functional domaki of the encoded protein, new proteins may be 
assembled through die novel reassortment of stable substructures, thus allowing acceleraticm of the 
evolution of new protein functions. 

A "fragment** is a unique portion of TRICH or the polynucleotide encodmg TRICH which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may conq>rise 

20 up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may con^rise from 5 to 1000 contiguous nucleotides or amino acid residues. 
A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at 
le^t 5, 10, 15, 16, 20* 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 CMtiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 

25 molecule. For exam|>le, a polypeptide fragment may coo^rise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as 
shown in a certain defined sequence. Clearly these lengths are exemplary, and any length that is 
supported by the specification, including the Sequence Listing, tables, and figures, may be 
encompassed by the present embodiments. 

30 A fragment of SEQ ID NO:33-64 comprises a region of unique polynucleotide sequence 

that specifically identifies SEQ ID NO:33-64, for exan^le, as distinct from any other sequence in 
the genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 

35 ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
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detenninable by one of ordinaxy skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ DO NOiSS-^. A 
fragment of SEQ ID NO-1-32 conqmses a region of unique amino acid sequence that specifically 
identifies SEQ ID NO:l-32. For exanople. a fragment of SEQ ID NO:l-32 is useful as an 
5 immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1- 
32. The precise length of a fragment of SEQ ID NO: 1-32 and the region of SEQ ID NO: 1-32 to 
which the fragment corresponds are routinely detemiinable by one of ordinary skill m the art based 
on the intended purpose for the fragment 

A ^'frill length'* polynucleotide sequcsice is one containing at least a tramlatioii initiation 

10 codon (e.g,, methionine) followed by an open reading frame and a translation tennination codon. A 
"foil length" polynucleotide sequence encodes a "frill length" polypeptide sequence. 

'ISomoiog/' refers to sequence sinularity ott interchangeably, sequence identity, between 
two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms *^rcent identity" and identity " as applied to polynucleotide sequences, refer 

IS to the percentage of residue matches tietween at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algoiittun may insert, in a standardized and reproducible way, gaps 
ui the sequences being con^ared in order to optunize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be detemuned using the default 

20 parameters of the CLUSTALV algorithm as mcorporateduito the MBOA^ 

sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Maxtison WI). CLUSTAL V is described in 
Kaggins. D.G. and Sharp (1989) CABIOS 5:151-153 and in Hggins, D.G. et aL (1992) 
CABIOS 8:189-191. For pairwiseaUgnnients of polynucleotide sequences, the de&ultpanuneters 

25 are set as follows: Ktuple=:2, gap penalty=5, wmdow=4, and '"diagonals savcd"=4. The "weighted" 
residue weight table is selected as the default. Percent identity is reported by (XUSTAL V as the 
^'percent similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and fi^eely available sequence comparison 
algorithms is provided by the National Center for Biotechnology friformation (NCBI) Basic Local 

30 Alignment Search Tool (BLAST) (Altschul, SJP. et al. (1990) I Mol. Biol. 215:403-410), which is 
available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www,ncbi,nlm,nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn " that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called 'TBLAST 

35 2 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. ''BLAST 2 
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Sequences'* can be accessed and used interactiveiy at http://www.ncbi.iilnLnih.gov/goi&bl2.htnd. 
The "BLAST 2 Sequeitces" tool can be used for both blasm and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
conqiare two nucleotide sequences, one may use blastn with the '"BLAST 2 Sequences*' tool 
5 Version 2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for 
exanq>le: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
10 C^i Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off : 50 • 

Expect: 10 ■ ' 

WordSiz0:U 

Filter: on 

IS Percent identity may be measured over the lengdi of an entire defined sequence, for 

example, as defined by a particular SBQ ID number, or may be measured over a shorter length, far 
example, over the length of a fragment taken from a larger, defined sequence, for instance^ a 
ftagment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 
contiguous nucleotides. Such lengtfis are exemplary only, and it is understood that any fragment 

20 Iraigth supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be 
used to describe a length over which percentage identic may be measured 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that 
changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic 

25 acid sequences that all encode substantially the same protem. 

The phrases ''percent identity" and "% identity," as applied to polypeptide sequences, refer 
to the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithnt Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 

30 substitutions, explained in more detail above, geoerally preserve the charge and.hydrophobicity at 
the site of substitution, thus preserving die structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be detennined using the default 
paran»ters of the CLUSTAL V algorittun as incorporated into the MEGAUGN version 3 J2e 
sequence alignment program (described and referenced above). For pairwise alignments of 

35 polypeptide sequences using (XUSTAL V, the default parameters aie set as follows: Ktuple=l, gap 
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penaltys:^3, wmdow=S, and "diagonals saved''=S. The PAM2S0 matrix is selected as the default 
residue weight table. As with polynucleotide alignments^ the percent identity is reported by 
CLUSTAL V as the ^'percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
5 comparison of two polypeptide sequences, one may use the **BLAST 2 Sequences" tool Ve^ion 
2.0.12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gq^: 1 penalties 
10 G/^p X drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Pm&al identity may be measured ov^ the length of an entire defined polypeptide $equence» 

IS for exan^le, as defined by a particular SEQ ID number, or may be measured over a shorts length, 
for example, over the length of a fragment talora from a laiger, defined polypeptide sequence, for 
instance, a fragment of at least IS, at least 20, at least 30, at least 40, at least SO^ at least 70 or at 
least ISO contiguous residues. Such lengths are exeniplary only, and it is understood that any 
fragment length supported by the sequences shown herein, in the tables, figure or Sequence 

20 listing, may be used to describe a I^igtfa over which percentage identity may be measured. 

'Human artificial chromosomes" (HACs) are linear nncrochromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. ^ 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 

2S sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

''Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary sttand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of 

30 con^lementarity. Specific hybridization complexes fonn under permissive annealiug conditions 
and remain hybridized after the **washing" step(s). The washing step(s) is particularly important in 
determining the stringency of the hybridization process, with more stringent conditions allowing 
less non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly 
matched. Permissive conditions for annealing of nucleic acid sequences are routinely determinable 

35 by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas 
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wash conditions may be varied among experimrats to achieve the desired stringency, and thetefom 
hybridization specificity. Permissive annealing conditions occur, for example, at 68^C in the 
presence of about 6 x SSC, about 1% (Wv) SDS, and about 100 ftg^ml sheared, denatured sahncm 
sperm DNA. 

5 Generally, stringency of hybridization is e?qpressed» in part, with reference to the 

temperature under which the wash step is carried out Such wash temperatures are typically 
selected to be about 5*^ to 20°C low^ than the th^mal melting point (T J for the specific sequence 
at a defined ionic strengtib and pH. The is the temperature (under defined ionic strength and pH) 
at which 50% of the target sequence hybridizes to a perfisctly noatched probe. An equation for 

10 calculating T„ and conditions for nucleic acid hybridization are well known and can be found in 
Sambrook, J. et al. (1989) Molecular Q oning: A Laboratory Manual. 2^ ed., voL 1-3, Q>ld Spring 
Harbor Pcess, Plamview NY; specifically see volume 2, chapter 9. 

Ifigh stringency conditions for hybridization between polynucleotides of the present 
invention include wash ccmditicms of 68!^C in the presence of about 0.2 x SSC and about 0.1% SDS, 

IS fori hour. Alteinativdy, temperatures of about 6S**C,60°C>SS*'Q or 42^Cniay be used. SSC 
concentration may be varied fiom about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 iigfuil. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under 

20 particular circumstances, such as for RNA:DNA hybridizations. Usefiil variations on these wash 
conditions will be readily apparent to those of ordinary skill in the art Hybridization, particularly 
under high stringency conditions, may be suggestive of evolutionary similarity betwera the 
nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their 
^oded polypeptides. 

25 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between con:q>lementary bases, A 
hybridization complex may be formed in solution (e.g., Qt or Rot analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 

30 substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" and ^'addition*' refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation^ trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 

35 of various factors, e.g., cytokines, chemokines, and other signaling nxolecules, which may affect 
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ceUular and systenuc defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response wh^ introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide 
S fragment of TRICH which is useftil in any of the antibody production methods disclosed herein or 
known in the art 

The tern "microairay" refers to an anangem^t of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms **elei]^t'* and "array element" refer to a polynucleotide, polypeptide* or other 
10 chemical conopound having a unique and defined position on a ndcroairay. 

The term "modulate" refers to a change in the activity of TRICEL For exainple, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any otfa^ 
biolo^cal, lEunctional, or immunological properties of TRICH. 

The phrases **nucieic acid" and '^nucleic acid sequence" refer to a nucleotide, 
IS oligonucleotide, polynucleotide, or any fragmfflfit thereof. These phrases also refer to DNA or RNA 
of gnomic or synthetic origin which may be singje-stranded or double^tranded and may represent 
the s^ose or the antisense strand, to peptide nucleic add (FNA), or to any DNA-like or RNA-lite 
matmal. 

"Operably linked" refi»rs to the situation in which a first nucleic acid sequence is placed in a 
20 functional ielationshq> with a second nucleic acid sequence. For instance, a promoter is opembly 
Imked to a coding sequence if the promoter affects the transcription or expression df the coding 
sequence. Operably linked DNA sequences may be in close proximity or cmtiguous and, where 
necessary to jcnn two protein coding regions, m the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-^ne agqnt which 
25 comprises an oligonucleotide of at least about S nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the con(4)o$ition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation^ and may be pegylated to extend their lifespan in the cell. 

'*Post-translationaI modification" of an TRICH may involve lipidation^ glycosylation, 
30 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TRICH. 

*Trobe** refers to nucleic acid sequences encoding TRICH, their complements, or firagments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
35 isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
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Typical labels include radioactive isotopes, ligands* chemilunmiescent agents, and enzynies. 
"Primers" aie short nucleic acids, usually DNA oligonucleotides, M^hich may be annealed to a taiget 
polynucleotide by conq)lenientaiy base^pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for anq[>lification (and 

5 identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PGR)* 
Ptobes and primers as used itt the present invention typically comprise at least IS 
contiguous nucleotides of a known sequence. In order to enhance specificity, long^ probes and 
primers may also be enq>loyed, such as probes and prijners that comprise at least 20, 25, 30, 40, 50, 
60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. 

10 Probes and primers may be considerably longer than these examples, and it is understood that any 
length supported by the spedfication, including the tables, figures, and Sequence Listmg, may be 
used. 

Methods fcNT preparing and using probes and primm are described m the references, for 
example Sambrook, L et al. (1989) Molecular Cloning: A Labcaatorv Manual. 2"* ed., vol. 1-3, Cold 

IS Spring Ibrbor Press, Plainview NY; Ausubel, F.M. et aL (1987) Current Proto cols in Molecular 
Biology, Gieene Publ. Assoc. & Wiley-Litersciences. New York NY; Innis, M. et aL (1990) PS 
Protocols. A Gu ide to Methods and Apohcations, Academic Press. San Diego CA PGR primer 
pairs can be derived from a known sequence, for example, by using computer programs intended for 
diat purpose such as Primer (Vocsion 0.5, 1991, Whitehead Institute for Biomedical Research, 

20 Cambridge MA). 

OligOTUcleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PGR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5>000 nucleotides from an input polynucleotide sequence of up to 32 Idlobases. Similar primer 

25 selection programs have incorporated additional features for expanded capabilities^ For example, 
the PrimOU primer selection program (available to the public from the Genome Center at University 
of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 
n^gabase sequences and is thus useful for designing primers on a genon^-wide scope. The PrimerS 
primer selection program (available to the public from the Whitehead Institute/MTT Center for 

30 Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences to avoid as primer binding sites are user-specified* PrimerS is useful, in particular, for 
the selection of oligonucleotides for microarrays. (The source code for the latter two primer 
selection programs may also be obtained from their respective sources and modified to meet the 
user's specific needs,) The PrimeGen program (available to the public from the UK Human 

35 Genome Mapping Project Resource Cenbie, Cambridge UK) designs primers based on multiple 
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sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is 
useful for identification of both unique and conserved oligonucleotides and polynucleotide 
fragments. The oligonucleotides and polynucleotide fragments identified by any of the above 

5 selection methods are useful in hybridization technologies, for example, as PGR or sequencing 
primers, microairay elements, or specific probes to identify fully or partially complementary 
polynucleotides m a sample of nucleic acids. Methods of oligonucleotide selection are not liznited 
to those described above. 

A ''recombinant nucleic acid'* is a sequence that is not naturally occuning or has a sequence 

10 that is made by an artificial combinatim of two (x more otherwise separated sepoents of sequence. 
This artificial cc»nbination is often acc(»iq>Iished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The teem recombinant includes nucleic acids that have 
been altered solely by addition, substttutton, or deletion of a portion of the nucleic add. Requently, 

15 a recombinant nucleic acid may include a nucleic acid sequence operabiy linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such lecombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombmant nucleic acid is 
20 expressed, inducing a protective immunological response in the mammal, 

A 'Regulatory elemenf * refers to a nucleic acid sequ^ce usually derived from untianslated 
regions of a gene and includes enhancers, promote, introns, and 5* and 3* untranslated regions 
(UTRs). Regulatoiy elem^ts intranet with host or viral proteins which control transcription, 
translation, or RNA stability. 
25 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemilununescent, or chromogenic agents; substrates; cofectors; inhibitors; mag^tic particles; and 
other moieties known in the art. 

An '*RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
30 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is con^osed of 
ribose instead of deoxyribose. 

The tenn "sample" is used in its broadest sense. A sample suspected of containing TRICHL 
nucleic acids encoding TRICH, or fragments thereof may con^rise a bodily fluid; an extract from a 
35 cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
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cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The tenns "specific binding" and "specifically binding'* lefer to that interaction between a 
protein or peptide and an agonist* an antibody, an antagonist, a small nx>lecule, or any natural ch^ 
synthetic binding composition* The interaction is dependent upon the presence of a particular 
5 structure of the protein, e.g,, tiie antigenic determinant or epitope, recognized by the binding 
molecule. For exan^le, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled 
A and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

10 removed from their natural environment and are isolated or separated^ and are at least 60% free, 
preferably at least 75% fireCt and most preferably at least 90% free from other components with 
which they are naturally associated. 

A ^^substitution*' refers to the replacemrat of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

15 "Substrate** refers to any suitable rigid or semi-rigid support including ixiend>ranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates» polymers, 
micrqpaiticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polyp^tides are bound. 

A "transcript imag^*' or "expression profile" refers to the collective pattem of gene 

20 expression by a particular cell type or tissue under giv^ conditions at a given time. 

'Transformation*' describes a process by which exogenous DNA is introduced into a 
recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the insertion of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformaticm is 

25 selected based on the type of host cell being transformed and may include, but is not limited to, 
bacteriophage or viral infection, electroporation, heat shocks lipofection, and particle bombardment 
The term "transformed cells** mcludes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating piasmid or as part of the host 
chromosome, as well as transientiy transformed cells which express the inserted DNA or RNA for 

30 limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by t-ansgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directiy or indirectly by introduction into a 

35 precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by 



33 



wo 02/46415 



PCT/US01W>9fi3 



infection with a recombinant virus. The term genetic manipulation does not include classical 
cross-teeeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant 
DNA molecule. The transgenic organisms contemplated in accordance with die preset invention 
include bacteria, cyanobactma, fungi, plants and animals. The isolated DNA of the present 
5 invention can be introduced into the host by methods known in the art, for example infection, 
tiansfection, transformation or tiansconjugation. Techniques for transferring the DNA of die 
present invration into such organisms are widely known and provided in references such as 
Sambrook et al. (1989), supra > 

A * Variant** of a particular nucleic acid sequ»ce is defined as a nucleic acid sequence 

10 having at least 40% sequence identity to the particular nucleic add sequence ov^ a certain length of 
one of the nucleic acid sequences using blastn with the ^'BLAST 2 Sequences^ tool Version 2.0.9 
(May-07-1999) set at default parameters/ Such a pair of nucleic acids may show, for example, at 
least S0%, at least 60%, at least 70%. at least 80%, at least 85%, at least 90%, at least 91%, at least 
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater 

15 sequence id^tity over a certain defined length* A variant may be described as, for example, an 
'^alldic" (as defined above), "splice," ''species," or '^slymoipfaic*' variant A splice variant may 
have significant identity to a reference molecule, but will g^erally have a greater or lesser number 
of polynucleotides due to alternate splicing of exons during ihRNA processing* The corresponding 
polypeptide may possess additional functional domains or lack domains that are pr^nt in the 

20 reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single 
nucleotide polymoiphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide 

25 base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, 
or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence 
having at least 40% sequence identity to the particular polypeptide sequence over a certain length of 
one of the polypeptide sequences using blastp with the **BLAST 2 Sequences" tool Version 2*0.9 

30 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at 
least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater sequence 
identity over a certain defined length of one of the polypeptides. 

35 THE INVENTION 
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The invention is based on the discovery of nev/ human transporters and ion channels 
(TRICH), the polynucleotides encoding TRiCH» and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological and cell proliferative 
disorders. 

5 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequ^ces of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single hicyte project idcaitification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification nuoib^ (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence nuxnber (bcyte Polypeptide ID) as shown. Bach polynucleotide 

10 sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID HO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as 
shown. 

Table 2 shows sequences with homology to the polypeptides of the invraition as identified 
by BLAST analysis against the (jcnBank protein (geiq»ept) database. Columns 1 and 2 show the 

15 polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the conespondmg 
Incyte polypeptide sequ^ce number (Incyte Polypeptide ID) for polypeptides of the invention, 
(^Immi 3 shows the OenBank id^tification number (CteoBank ID NO:) of the nearest (j^ank 
homolog. Column 4 shows the |»obability scores for the matches between each polypeptide and its 
horaDlog(s). Column 5 shows the annotation of the GenBank homolog(s) along with relevant 

20 citations where applicable, all of which are expressly incorporated by reference hmin. 

Table 3 shows various structural features of the polypeptides of the invention. Ck>lumns 1 
and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Bicyte polypeptide sequence number (lacy te Polypeptide ID) for each polypeptide of the inventicm. 
Column 3 show$ the number of amino acid residues in each polypeptide. Column 4 shows potential 

25 phosphorylation sites, and column S shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package ((jenetics Computer Group, 
Madison WI). Column 6 shows amino acid residues comprising signature sequences, donvuns, and 
motifs. Column 7 shows analytical methods for protein structure/function analysis and in some 
cases^ searchable databases to which the analytical methods were applied. 

30 Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and 

these properties establish that the claimed polypeptides are transporters and ion channels. For 
example, SEQ ID N0:1 is 83% identical to human sodium-hydrogen exchanger 6 (GenBank ID 
g2944233) as determined by the Basic Local Alignment Search Tool (BLAST), (See Table 2,) The 
BLAST probability score is 5.1e-242, which indicates the probability of obtaining the observed 

35 polypeptide sequence alignment by chance, SBQ ID NO: 1 also contains a sodium/hydrog&n 
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exchanger family domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM>based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS, and other BLAST analyses provide further corroborative evidence 
that SBQ ID N0:1 is a sodium4>ydrogen exchange transporter. In another exan^le, SEQ JD N0:7 
5 is 85% identical to Rattus norvericus Na*/JK:*-ATPase alpha snbunit (GenBank ID g61991S) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 0*0, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID N0:7 also contains an E1*E2 ATPase domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 

10 based PFAM database of coxiservedpffotein family domauis. (See Table 3«) Data from BLIMPS, 
MOnPS, and PROFILBSCAN analyses provide fiurtber corroborative evidence that SBQ ID NO:7 
is a cadon-dansporting ATPase. la yet another example* SEQ ID NO: 13 is 77% identical to a 
hunoan canier-Iike protein ((SenBank ID g3694661) as deteimmed by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probainlity sc<m is S.Se-209, which indicates 

IS the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID 
NO: 13 also contains a mitochondrial energy transfer protein domain as determmed by searching for 
statistically significant matches in die hidden Markov model (HMM)-based FFAM database of 
conserved protein family domains. (See Table 3.) The presence of this domain is confirmed by 
BLIMPS, MOTIFS, and PROFILBSCAN analyses, providing further corroborative evidence that 

20 SEQ ID NO: 13 is a transporter. Further, SEQ ID NO: 16 is 41% identical to human novel ATPase 
(GenBank TD g897980l) as determined by Ae Basic Local Alignment Search Tool (BLAST). (See 
Table 2,) The BLAST probability score is l.le-165, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO: 16 also contains an E1-E2 
ATPase domain as determined by searching for statistically significant matches in the hidden 

25 Markov model (HMM)'based PPAM database of conserved protein family domains. (See Table 3.) 
Data from BLIMPS, MOTff S, and PROFILBSCAN analyses provide fimher coiroborative evidence 
that SEQ ID NO: 16 is a cation-transporting ATPase. In a further example, SEQ ID NO: 19 is 43% 
identical to Sinorhizobium sp. As4 ArsA, the catalytic subunit of the arsenic oxyanion-translocating 
ATPase (GenBank ID g5802945) as determined by the Basic Local Alignment Search Tool 

30 (BLAST). (See Table 2,) The BLAST probability score is 7.76-125, which indicates the probability 
of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 19 also contains 
an anion^transpoiting ATPase domain as determined by searching for statistically significant 
matches m the hidden Markov model (HMM)-based PPAM database of conserved protein family 
domains, (See Table 3.) Data from additional BLAST and PROHLESCAN analyses provide 

35 further conoborative evidence that SBQ ID NO: 19 is an anion-transporting ATPase. In yet a further 
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example, SEQ E> N0:21 is S4% identical to a tnurine putative E1-E2 ATPase (GeoBank ID 
g28577) as detenmned by the Basic U)calMgmnent Search Tool (BI^ (See Table 2.) The 
BLAST probability score is S.2e-190, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID N0:21 also contains six transmembrane 
5 domains as determined using TMAP> a program which delineates transmembrane segments. (See 
Table 3.) Data from BLIMPS, and MOTIFS, analyses provide fuitiier corroborative evidence that 
SEQ ID N0:21 is an ATPase. In a futher example, SEQ ID NO:24 is 52% identical* from residue 
A77 to residue L1007, to rat NMDAR-L (GenBank ID g2l6012S) as detennined by the Basic Local 
Alignment Search Tool (BLAST). (See Table Z) The BLAST probabili^ scosre is I.Se-262* which 

10 indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO:24 also contains a ligand gated ion channel domain as detemiined by searching for 
statistically significant matches in flie loiddm Markov model (BMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) Data from BLIMPS analysis provide further 
corroborative evidence diat SEQ ID NO:24 is a glutamate receptor, SEQ ID NO:2-6, SEQ ID N0:8- 

15 12, SEQ ID NO:14-15, SEQ ID NO:17-18. SEQ ID NO:20, SEQ ID NO:22^23, and SEQ ID NO:25- 
32 were analyzed and annotated in a sinularmann^. The algorithms and parameters for the 
analysis of SEQ ID NO:l-32 are described m Table 7. 

As shown in Table 4« die full leiigth polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

20 combination of these two types of sequences, (jolumn 1 lists the polynucleotide sequ^ce 
id^tificatiott numb^ (Polynucleotide SEQ ID NO:), the corre^nding Incyte polynucleotide 
consensus sequence numb^ (Incyte ID) for each polynucleotide of the invention, and the length of 
each polynucleotide sequence in basepairs. Column 2 shows the nucleotide start {5') and stop (30 
positions of the cDNA and/or genomic sequences used to assemble the fiill length polynucleotide 

25 sequences of the invention, and of fragments of the polynucleotide sequences which are useful, for 
example, in hybridization or amplification technologies that identify SEQ ID NO:33^ or that 
ctistinguish between SEQ ID NO:33-64 and related polynucleotide sequences. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 
example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 

30 libraries. Alternatively, the polynucleoride fragments described in column 2 may refer to GenBank 
cDNAs or BSTs which contributed to the assembly of the full length polynucleotide sequences. In 
addition* the polynucleotide fragments described in column 2 may identify sequences derived from 
the ENSEMBL (The Sanger Centre. Cambridge, UK) database (Le., those sequences including the 
designation **ENST'). Alternatively, the polynucleotide fragments described in column 2 may be 

35 derived from the NC3BI RefSeq Nucleotide Sequence Records Database (f. e., those sequences 
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including the designation "NNT* or ''NT") or the NCBI RefSeq Ptotein Sequence Records (i.e.» 
those sequences including the designation ^'NP'), Alternatively, the polynucleotide fragments 
described in colunui 2 may lefer to assemblages of both cDNA and Genscan-predicted exons 
brought together by an "'exon stitching" algorithm. For example, a polynucleotide sequence 

XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and ITnT is tihe number of the prediction generated by the algorithm, and if 
preset, represent specific exons that may have been manually edited during analysis (See Bxample 
y). Alternatively, the polynucleotide IGragments in column 2 may refer to assemblages of exons 

10 brought together by an *'exon-stretcbing'* algorithm. For example, a polynucleotide sequence 
identified as fLXXXXXX^jiAAAAA^BBBBJiJf is a "stretched" sequence, with XXXXXX bemg 
the Incyte project identification number, gMAAA being the OenBank identification number of the 
human goaoDEiic sequence to which the ^exon-stretching^ algoritfam was s^pl^^* gfiBBBB being the 
GenBank id^itification number or NCBI RefSeq identification number of the nearest GenBank 

15 protein homolog, and N refeiring to specific exons (See Exanqple V). hi instances where a RefSeq 
sequence was used as a protein homolog for the '*exou-stretching** al^irithm^ a RefSeq identifier 
(denoted by ''NM,** •TUP," or •W) may be used in place of tiie GraBank identifier (i.e., gPSBBB), 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted 
from gnomic DNA sequences, or derived from a combination of sequence analysis methods. The 

20 following Table lists examples of compon^t sequence prefixes and corresponding sequence 
analysis methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GPG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Gonq>uter Genomics Group, The Sanger Cientie, Cainbridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST con^sition 
data are combined to predict the exons and resulting transcript. 



In some cases, bcyte oDNA coverage redundant with the sequence coverage shown m 
30 Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shown. 

Table S shows die representative cDNA libraries for those full length polynucleotide 
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sequences which weie assembled using Ihcyte cDNA sequences. The repiesentadve cDNA libiaiy 
is the Ihcyte cDNA library which is most frequently lepresented by the bcyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 
vectors which were used to construct the cDNA libraries shown in Table 5 axe desoibed in Table 6. 

5 The invention also encompasses TRICH variants. A preferred TRICH variant is <me which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TIUCH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICR 

The invention also encompasses polynucleotides which encode TRICH, In a particular 

10 embodiment* the invention encompasses a polynucleotide sequmce conq>rising a sequence selected 
from the group consisting of SEQ ID NO:33-64, which encodes TRICH* The polynucleotide 
sequences of ID NO:33-64, as pres^ted in the Sequence Listing* embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is conq>osed of ribose instead of deoxyribose. 

15 The invcnaticHi also encompasses a variant of a polynucleotide sequence encoding TRICH. 

hi particukir, such a variant polynucleotide sequence will have at least about 7Q%t altonatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRICH. A particular aspect of the invention enconq>asses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 

20 NO:33"64 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:33-64. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one fimctional or stmcUiral characteristic of TRLCH. 

In addition, or in the alternative^ a polynucleotide variant of the invention is a splice variant 

25 of a polynucleotide sequence encoding TRICR A splice variant may have portions which have 
significant sequence identity to the polynucleotide sequence encoding TRICH, but will generally 
have a greater or lesser number of polynucleotides due to additions or deletions of blocks of 
sequence arising from alternate splicing of exons during mRNA processing. A splice variant may 
have less than about 70%, or alternatively less than about 60%, or altftmativcly less than about 50% 

30 polynucleotide sequence identity to the polynucleotide sequence encoding TRICH over its entire 
length; however, portions of the splice variant will have at least about 70%, or alternatively at least 
about 85%, or alternatively at least about 95%, or alternatively 100% polynucleotide sequence 
identity to portions of the polynucleotide sequence encoding TRICH. Any one of the splice variants 
described above can encode an amino acid sequence which contains at least one functional or 

35 structural characteristic of TRICH. 
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It will be appreciated by those skilled in the art that as a tesult of the deg^eracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing xmnimal 
sinularity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the mvention contemplates each and tve^ possible variation of polynucleotide 

5 sequence that could be made by selecting combmations based on possible codon choices. These 
combinations axe made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurcing TRICH, and all such variations are to be considered 
as being specilScally disclosed 

Although nucleotide sequences which encode TRICH and its variants are generally capable 

10 of hybridizii^ to the nucleotide sequence of the naturally occuning TRICH under appropriately 
selected conditicHis of stringency, it may be advantageous to produce nucleotide sequences encoding 
TRICH or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occuning codcms. Codonsinay be selected to increase the rate at which es^ression of the 
peptide occurs in a particular prokaryottc car eukaryotic host m accordance with the frequency with 

15 which particular codons are utilized by the host Oth^reasmsforsubstantiaUy altering the 
nucleotide sequence encoding TRICH and its derivatives without altmng the encoded amino acid 
sequences include the production of RNA transcripts having more deskable properties* such as a 
greater half^Ufe, than transcripts produced from the natuially occuning sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 

20 TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production^ the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the ait. Moreover, synthetic chemistiy may be used to 
introduce mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 

25 hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency, (See, e.g., Wahl, OM. and 
SX, Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods EnzymoL 
152:507-511.) Hybridization conditicms, including annealing and wash conditions* are described in 
•Ttefinitions;' 

30 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fr^agment 
of DNA polymerase I, SEQUENASE (US Biochemical, Qeveland OH). Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Ametshain Pharmacia Biotech, Piscataway NJ)* or 
combinations of polymerases and proofreading exonucleases such as those found in the 

35 ELONGASE amplification system (life Technologies, Gaithetsburg MD). Preferably, sequence 
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preparation is automated-with machines such as the MICROLAB 2200 liquid transfer system 
(Hflmilton, Reno NV), PTC200 thermal cycler (MJ Research. Watertown MA) and ABI 
CATALYST 800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either 
the ABI 373 or 377 DNA sequencing system (Applied Biosystems), the MEGABACB 1000 DNA 
5 sequencing system CMolecular Dynamics, Sunnyvale CA), or other systems known m the art The 
resulting sequences are analyzed using a variety of algorithms which are well known in the art 
(See, e^fr, Ausubel, EM, (1997) Short Protocols m Molecular Biolo^ . John Wiley ft Sons, New 
York NY, unit 7.7; Meyers, R.A- (1995) Molecular Biology and Biotechnology, WUey VC3H, New 
York NY, pp. 856-853.) 

10 The nucleic acid sequences encoding TRICH may be extended udlizmg a partial nucleotide 

sequCTce and employing various PCR-based methods known in the art to detect upstream 
sequences, such as promoters and regulatoiy elements. For example, one mediod which may be 
employed, restriction-site PCR, uses universal and nested primers to arnplilEy unknown sequence 
from genomic DNA within a cloning vector. (See, e.g.» Sarkar, O. (1993) PGR Melhods Applic. 

15 2:318-322.) Another method, inverse PCR, uses primers that extend m divergent directions to 
amplify unknown sequence fiom a ciicul^ized template. The t^hte is dmved from restriction 
fragments comprising a known genoniic locus and sunounding sequences. (See, e.g«» Triglia, T. et 
al. (1988) Nucleic Acids Res. 16:8186.) A third method, capnue PCR« involves PGR arr^lification 
of DNA fragments adjacent to known sequences in human and yeast artificial chromosome DNA. 

20 (See, e.g.,I^gemrom*M.etaL (1991) PGR Methods Applic. 1:111-119.) In this method, multiple 
restriction enzyme digestions and ligations may be used to insert an ^gineered double^tranded 
sequence into a region of unknown sequence before performing PCR. Other methods which may be 
used to retrieve unknown sequences are known in the art (See, e.g., Parfcer, JD. et al. (1991) 
Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR. nested primers, and 

25 PROMOTERHNDER libraries (Qontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers may be designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to 
be about 22 to 30 nucleotides in lengtii, to have a GC content of about 50% or more, and to anneal 

30 to tiie template at tenq)eratures of about 6E''C to 72**C. 

When screening for full lengtii cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a fiilHength cDNA. Genomic libraries may be useful for extension of 

35 sequence mto 5' non-transcribed regulatory regions. 
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Capillary electrophoresis systems which are conimercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary 
sequencing may employ flowable polymers for electropboretic separation* four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

5 etoitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., OENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of saniples to computer analysis and electronic data display may be computer 
ccmtroUed. Capillary electrophoresis is especiaUy preferable for sequencing small DNA fragments 
which may be preset bk linuted aitK>imt5 in a particular sanqple, 

10 b another embodiment of die invention, polynucleotide sequences or fragments thereof 

which encode TRICH may be cloned in recombinant DNA molecules that direct eicpression of 
TRICH, or fragDoents or fimcti<xkal equivalmts thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same 
or a fimctionally equivalent amino acid sequence may be produced and used to express IttlCXI 

15 Itie nucleotide sequ^ces of the present invention can be en^eeied using methods 

generally known in the art in order to alter TRICH-encoding sequences for a variety of purposes 
includingt but not limited to, modification of the cloning, processing, and/or expression of Hie gene 
product DNA shufifliiig by random fragmentation and PGR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engir^er the nucleotide sequences. For exanqple, 

20 oligonucleotide-mediated site-directed mntagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylatim patterns^ change codon prefen^ce, produce splice variants, 
and so forth. 

The nucleotides of the present invention may be subjected to DNA diuffling techniques 
such as MOLBCULARBREEDINO (Maxygen Inc., Santa Clara CA; described in U.S. Patent Na 

25 5,837,458; Chang, C.-C et al. (1999) Nat BiotechnoL 17:793-797; Christians, F.C et al. (1999) 
Nat. BiotechnoL 17:259-264; and Crameri^ A. et al. (1996) Nat Biotectmol. 14:315-319) to alter or 
improve the biological prop^es of TRICH, such as its biological or enzymatic activity or its 
ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of 
gene variants is produced using PCR-mediated recombination of gene fragments* The library is 

30 then subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and frirth^ subjected to recursive rounds 
of DNA shuffling and selection/screening. Thus, genetic diversity is created through ''artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing 
random point mutations may be recombined, screened, and then reshuffled until the desired 

35 properties are optimized. Alternatively, fragments of a given gene may be recombined with 
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fiagtnents of homologous genes in the same gene family, either from the same or difGraent species, 
thereby maximizing the genetic div^ity of multiple naturally occiuiing genes in a directed and 
controllable mann^. 

In another embodim^t, sequences encoding TRICH may be synthesized, in whole or in 
5 part, using chemical methods well known in the art (See, e.g., Caruthers. M.R et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids SyixQ). Ser. 
7:225-232.) Alternatively, TRICH itself or a fragment tkoeof may be syndiesized using chemical 
methods. Fch- example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques. (See, e.g., (^igbton, T. (1984) Pyoteins, Strw^ures and Molecular 
10 BrDoerties. WH Freeman, New York NY. pp. 55-60; and Roberge, J.Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using the ABI 431 A peptide synthesizer 
(Applied Biosystems). Additionally, the amino acid sequence of TRIC^ (mt any part theieot may 
be altered during direct synthe^s and/or combined with sequences from other proteins, or any part 
thereof, to ptoduce a variant polypeptide or a polypeptide having a sequraice of a naturally occurring 
15 polypq>tide. 

The peptide may be substantially purified by preparative high perfonnance liquid 
chromatography. (See, e.&. Obiez, and F.Z. Regnter (1990) Methods EnqonoL 182:392-421.) 
The composition of the syntihetic peptides may be conlBrmed by anuno acid analysis or by 
sequencing. (See, e.g., Creig^ton, supra, pp. 28^53.) 

20 IhoidertoexpressabiologicaUy active TRICXt, the nucleotide sequences encodm 

or derivatives thereof may be insoted into an appropriate expression vector, i.e., a vector which 
ccmtains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5* and 3' untranslated regions in the vector and in 

25 polynucleotide sequences encoding TRICH Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TRICH. Such signals include the ATG initiation codon and adjacent 
sequences, e*g. the Kozak sequence* In cases where sequences encoding TRICH and its initiation 
codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 

30 additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 

35 particular host cell system used. (See, e.g.. Scharf, D. et al. (1994) Results ProbL Cell Differ. 
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Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding TRICH and appropriate transcriptional and 
translational control elements. These methods include invitrp recombinant DNA techniques, 
5 synthetic techniques, and in vivo genetic recombination. (See» e.g., Sambrook» L et al. (1989) 
Molecular aoning, A Laboratorv Manuals Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, FM. et al. (1995) Cnment Protocols in Molecular Biology. Jolm Wiley & Sons, 
New York NY. ch. 9. 13. and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 

10 sequences ^coding TRICH. These include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bactmophage« plasmtd, or cosmid DNA expression vectcm; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 
mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti 

15 or pBR322 plasmids); or animal cell systems. (See. e.g.. Sambrook, sn pm: Ausubel, supra: Van 
Beeke, G. and S.M. Schuster (1989) J. BioL Ch^ 264:SS03-SS09; Engelhardi EX et al. (1994) 
Proc, Natl, Acad. Sd. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; 
Takamatsu, N. (1987) EMBO J. 6:307-311; The McGtaw IBll Yearbook of Science and Technolopv 
(1992) McGraw Hill, New York NY. pp. 191-196; Logan, L and T. Shenk (1984) Ptoc. Natl. Acad. 

20 Sci. USA 81:3655-3659; and Harrington, JJ. et al. (1997) Nat Genet. 15:345-355.) Ejcpiession 
vectors dmved fiom retroviruses! adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasmids, may be used for deliveary of nucleotide sequences to the targeted organ, tissue* or 
cdl population. (See, eg., Di Nicola, M et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et 
al. (1993) Ptoc. NatL Acad. Sd. USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 

25 317(6040):813-815; McGregor, aP. et al. (1994) Mol. Immunol. 31(3):219-226; and Venna, LM 
and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICH. For exan^Ie, routine 
cloning, subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved 

30 using a multifunctional E. coU vector such as PBLUESCRIPT (Stratagene, La JoUa CA) or 

PSPORTl plasmid (Life Technologies). Ligation of sequences encoding TRICH into the vector^S 
multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for 
identification of transformed bacteria containing recombinant molecules, hi addition, these vectors 
may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage^ 

35 and creation of nested deletions in the cloned sequence. (See» e.g.. Van Heeke, G. and S.M. 
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Schuster (1989) J. Biol Chem. 264:5503-5509.) Wh&n large quantities of TRICH arc needed. e,g. 
for the production of antibodies, vectors which direct high level expression of TRICH noiay be used. 
For example, vectors containing the strong, inducible SP6 or T7 bacteriophage promoter may be 
used. 

5 Yeast e)q[>ression systems may be used for production of TRICH. A number of vectors 

containing constitutive or mducible promoters, such as alpha factor* alcohol oxidase, and PGH 
promoters, inay be used in the yeast Saccharomvces cerevisiae or Pichiapastoris> In additicm, such 
vectors direct either the secretion or intracellular nstention of expressed protems and enable 
integration of foreign sequences into the host genome for stable pn^gation. (See, e.g.« Ausubel» 

10 1995, supra : Bitter. GA. et aL (1987) Methods ^ymoL 153:516-544; and Scorer, CA. et al. 
(1994)Bionrechnology 12:181-184.) 

Plant systems may also be used for expressicm of TRICSL Transcription of sequences 
encoding TRICH may be driven by vual promoters, e.g., the 3SS and 19S promoteis of CaMV used 
alone or in combination with the omega leader sequmoe from TMV (TakamatsUt N. (1987) EMBO 

15 J. 6:307^311). Alternatively, plant promoters such as the small subunit of .RUBISCD or heat shock 
promoters my be used. (See, e.g., Gonizzi, G. et al. (1984) EMBO J. 3:1671-1680; Bro^e, R. et 
al. (1984) Science 224:838-843; and Winter, J. et aL (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation en* 
-""•pathogen-mediated transfection, f See, e.g.. The McGraw Hill Yeaibook of Science and Technologv 

20 (1992) McGraw HUl, New YorkNY, pp. 191-196.) 

In noamnialianceUs, a nuniber of viral-based expression systems n^ Incases 
wheie an adenovums is used as an expression vector, sequences encoding TRICH may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence, hisertion in a non-essential El or £3 legion of the viral genome may be used to 

25 obtain infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Sci. USA 81:3655-3659.) Jn additioui transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 
SV40 or BE V-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

30 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationlc anuno 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
35 of TRICH in cell lines is preferred. For exan^le, sequences encoding TRICH can be transformed 
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into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a sepaxate vector. 
Following the intnxluction of the vector, cells may be allowed to grow for about 1 to 2 days m 
enriched media before being switched to selective media. The puipose of the selectable marker is to 

5 confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 'Diese 
include, but are not limited to* the herpes sunplex virus thynd^e kinase and adenine 

10 phosphoribosyltransferase genes, for use in ftr and apr' cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) on 11:223-232; Lowy, I. et aL (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or hexbicide resistance can be used as the basis for selection. Fhrexandple,<0{^C(mfers resistance 
to methotrexate; neo confm resistance to the aminoglycosides neomycin and 6-418; and als and 
pat confer resistance to chlorsulfuron and phosphinotricin acetyltiansferase, respectively. (See, e.g., 

15 Wigler, M. et al. (1980) Ptoc. Nad. Acad. ScL USA 77:3567-3570; Colbeie-Garapm, R et al. (1981) 
J. Mol. Biol 150:1-14.) Addidonal selectable genes have be^ described, e.g., trpB and hbD, 
which alter cellular requirements for metabolites. (See, e.gM Hartman, S.C and R.C. Mulligan 
(1988) Proc. Natl, Acad. Sci. USA 85:8047*8051.) Visible markers, e.g., anthocyanins, gteen 
fluorescent proteins (OFF; Qontech), B glucuronidase and its substrate B-glucuronide, or luciferase 

20 and its substrate luciferin may be used* These markers can be used not only to identify 

transfbrmants, but also to quantify the amount of transient gt stable protein esqwession attributable 
to a specific vector system. (See, e.g., Rhodes, CA. (1995) Metiiods Mol. Biol. 55:121-131.) 

AldKHigh the piesence/absence of marker gene expression suggests that die gene of interest 
is also present, the presence and expression of the gene may need to be confirmed. For exan^le, if 

25 the sequence encoding TRICH is inserted within a marker gene sequence, transformed cells 
containing sequences encoding TRICH can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed m tandem with a sequence encoding TRICH under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well 

30 In general^ host cells that contain the nucleic acid sequence encoding TRICH and that 

express TRICH may be identified by a variety of procedures known to those of skill in the art 
These procedures include* but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, 
or chip based technologies for the detection and/or quantification of nucleic acid or protein 

35 sequences. 
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ImnDunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or inonoclonal antibodies are known in the art* Examples of such techniques 
include enzyme-linked immunosorbent assays (BLISAs)» radioimmunoassays (RiAs), and 
fluorescence activated cell sorting (PACS). A two-site, monoclonal-based inmuinoassay utilizing 
5 monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive bindii^ assay may be employed. These and other assays are weU known in the art 
(See, e*&. Hampton. R. et al (1990) Serolopical ^lethods> a Laboratory Manual. APS Press, St Paul 
MN, Sect IV; Coligan, JJB. ct al (1997) Current Protocols in Immunolo g y. Gieene Pub. Associates 
and Wiley-lnterscience, New Yorit NY; and Pound, J J>. (1998) Titim^ ^n ^hftn iical Ptotocols. 

10 Humana Press, Totowa NJ.) 

A viide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used m various nucleic acid and ammo acid assays. Means for producing labeled 
hybridization oi PGR probes for detecting sequences related to polynucleotides encoding TRICH 
include oligolabettng» nick translation, end-labeling, or PGR an^lification using a labeled 

15 nucleotide* Alternatively, ibt sequences encoding TRICH, or any ftagm^ts thereof, may be cloned 
into a vector for die production of an mRNA probe. Such vectors are known in the art, ai9 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of comniercially available kits, such as tiiose provided by 

20 AmershamPharaoflcia Biotech, Promega (Madison WI), and US Biochenu^ Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzynfies^ 
fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the libs. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 

25 conditions suitable for the expression and recovciy of the protein from ceD culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the 
sequence and/or the vector used. As will be understood by those of skill in the art. expression 
vectors containing polynucleotides which encode IHICH may be designed to contain signal 
sequences which duect secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

30 In addition, a host cell strain nmy be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation. glycosylation, 
phosphorylation, lipidation, and acylation, Post4ranslational processing which cleaves a *'piepro" 
or "pro" form of the protein may also be used to specijfy protein targeting, folding, and/or activity. 

35 Different host cells which have specific cellular machinery and characteristic mechanisms for 
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post-translational activities (e.g,, CHO, HeLa, MDCK, HHK293. and WI38) are available from the 
American Type Culture Collection ( ATCC, Manassas VA) and may be chosen to eosure the conect 
modification and processing of the foreign protein. 

£q another embodiment of the invention, natural modified, or recombinant nucleic add 
5 sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For exan^le, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein 
and peptide moieties may also facilitate purification of fiision proteins using commmially available 

10 affinity matrices. Such moieties include, but are not limited to, glutathione S-tnmsferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), cafanodulin binding pqptide (CBP), 6-Hi5, 
FLAG. C'tnyc, and hemagglutmin (HA). GST. MBP, Tix* <3BP, and 6-Hi8 enable pmificaiion of 
their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin* 
and metal-chelate resins* respectively. FLAG, c-tnyc^ and hemagglutinin (HA) enable 

15 immunoaEGnity purification of fusion prc^ins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 
engineered to contain a proteolytic cleavage site located between the TEUCH encoding sequence and 
the heterologous protein sequence, so that TRICH may be cleaved away firom ttie heterologous 
moiety following purification. Methods for fusion protein expression and purification are discussed 

20 in Austtbel (1995, supra, ch. 10). A variety of commerci^y available kits may also be used to 
facilitate expression and purific^ition of iiision proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat genn extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with 

25 the T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, ^S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 

30 oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or 
a natural binding partner. (See, e.g., Coligan, IE. et al. (1991) Current Protocols in Immunology 
l(2):^Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which 

35 TRICH binds, or to at least a fragment of the receptor, e.g,, the ligand binding site. In either case. 
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the compound can be rationally designed using known teclmiqnes.' In one embodiment, screening 
for these compounds involves producing appropriate cells which express TRICH« either as a 
secreted protein or on the cell membrane. Preferred cells include cells from mammals, yeast, 
DrosQphila. or E. coli . Cells expressing TRICH or ceU membrane fractions which contain TRICH 

5 are then contacted with a test compound and binding, stimulation, or inhibition of activity of either 
TRICH or the compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluoropbore, radioisotope, enzyme conjugate, or other detectable label. F6r example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 

10 solution or affixed to a solid support, and detecting the binding of TRICH to tfae compound. 
Alternatively, the assay may detect or measuie binding of a test compound in the pi^esence of a 
labeled competitor* AdditionaUy, tiie assay may be canied out ushig cell-fiee prq>arations, 
chemical libraries, or natural product mixtures, and the test concqK>und(s) may be free in solution or 
alRxed to a solid su]^ort 

15 TRICH of the present invention or fiagmmts thereof may be used to screen for compounds 

that modulate the activity of TRICH. Such conqiounds may include agotusts, antagonists* or partial 
or inverse agonists. In one embodiment, an assay is perfcmned und^ conditions pennissive for 
TRICH activity, wherein TRICH is combined with at least one test compound, and tfae activity of 
TRICH in the presence of a test compound is con^ared with the activity of TRICH in the absence 

20 ofdietestccmipound. Achangpmtheactiviiy of TRICH in tfae presence of the test compound is 
indicative of a compound that modulates the activity of TRICH. Alternatively, a test compound is 
combined with an in vitro or cell-fiee system conqxrising TRICH under conditions suitable for 
TRICH activity, and tfae assay is performed. In either of these assays, a test compound which 
modulates (he activity of TRICH may do so indirectly and need not com in direct contact with the 

25 test compound. At least one and up to a plurality of test con^unds may be screened. 

In ffliother embodiment, polynucleotides encoding TRICH or theh^ mammalian homologs 
may be "knocked out" in an animal model system using homologous recombination in emln^onic 
stem (ES) cells. Such techniques are well known in the art and are useful for the genemtion of 
animal models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and VS. Patent No. 

30 5,767,337.) For example, mouse ES cells, such as the miouse 129/SvJ cell line, are derived from the 
early mouse embryo and grown in culture. Hie ES cells are transformed with a vector containing 
the gene of interest disrupted by a marker gene, e.g«, tfae neomycin phosphotransfeiase gene (neo; 
Capecchi, M,R, (1989) Science 244:1288-1292). The vector integrates into the corresponding 
region of the host genome by homologous recombination. Alternatively, homologous 

35 recombination takes place using the Cre-loxP system to knockout a gene of interest m a tissue-- or 
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developmental stage-specific maimef (Marth. J.D. (1996) Clin. Invest 97: 1999-2002; Wagner, K.U. 
et al. (1997) Nucleic Acids Res. 25:4323-4330). Transfonned £S cells are identified and 
microinjected into mouse cell blastocysts such as those from the CS7BL/6 mouse strain* The 
blastocysts are surgically transferred to pseudopregnant dams* and the resulting chimeric progeny 
5 are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES ceUs have the potential to differentiate into at least eig}it separate 
cell lineages including endodeim, mesoderm, and ectodmnal cell types. These cell lineages 

10 differentiate mto, for example, neural cells, hematopoietic linea^, and cardiomyocytes (Thomson^ 
LA. et al. (1998) Science 282:1145-1147). 

Polynucleotides encoding TRICH can also be used to create 'Imockin" humanized animals 
(pigs) or transgenic animals (mice or rats) .to model human disease. With knockin technology, a 
region of a polynucleotide encodbg TOICH is injected into animal ES cells, and the injected 

IS sequence integrates into the animal cell genome. Transfonned cells are injected into blastulae, and 
the blastulae are implanted as described above. lYansgenic progeny oi: inbred lines are studied and 
treated with potential pharmaceutical agents to obtain inf ommtion on treatment of a human disease. 
Alternatively, a mammal infared to overexpress TRICH, e.g., by secreting TRICH in its milk, may 
also serve as a convenient scHUce of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 

20 4:55-74). 

THERAPEUTICS 

Chemical and structural similarity, e.g.« in the ccMntext of sequences and motifs, exists 
between regions of TRICH and tcanspi^ers and ion channels. In addition, examples of tissues 
expressing TRICH can be found in Table 6. Therefore, TRICH appears to play a role in tnmsport, 

25 neurological, muscle, immunological and cell proliferative disorders. In the treatment of disorders 
associated with increased TRIC3I ^pression or activity, it is desirable to decrease the expression or 
activity of TRICH. In the treatment of disorders associated wifli decreased TRICH expression or 
activity, it is desirable to increase the expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

30 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis^ ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy. Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis^ 

35 nonnokalemic periodic paralysis. Parkinson's disease, malignant hyperthermia, multidrug 
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resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 
peripheral neuropathy* cerebral neoplasms, prostate canc^, cardiac disord^ associated with 
transport, e.g,, angina, bradyarrythmia, tachyarrythmia, hypertension. Long QT syndrome, 
rnyocarditist cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 
5 mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 
Alzhdmer's disease, amnesia, bipolar disorder, dem^tia, depression, epilepsy, Tourette's disc«der, 
paranoid psychoses* and schizophrenia, and other disoiders associated with transport, e.g^, 
neurofibromatosis, postherpetic neuralgia, trigemmal neuropathy, sarcoidosis, sicUe cell anemia, 

10 WilscHi's disease, cataracts, inutility, pulmonary artery stenosis, sensorinental autosomal deafness, 
hypeiglyceinia, hypoglycemia. Grave's disease, goiter, Cushing's disease, Addison's disease, 
glucose-galactose malabsorption syndrome, glycogen storage disease, hypercholest^leniia, 
adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronismtype 1, Liddle's syndrome, cystinuria, iminQgiycinuria, Hartup 

15 disease, Panconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
c^brovascular disease, stroke, cmbral neoplasms, Alzheimer's disease. Pick's disease, 
Huntington's disease, dementia, Paridnson^s disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural musculo 
atrophy, letinttis pigmentosa, hereditary ataxias^ multiple sclmisis and other demyelinating 

20 diseases, bact^ial and viral m^ungitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nmous system 
disease, prion diseases including kuiu, Creutzfeldt Jakob disease, and Gerstmanm- 
Straussler-Scheinkor syndrome, £atal famiUal insomiua, nutritional and metabolic diseases of the 
nervous system^ neurofibromatosis, tuberous sclerosis, c^bellcxretinal beman^oblastomatosis, 

25 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndmme, cerebral palsy, neuroskeletal disordm, autonomic 
nervous syston disorders, cranial n^e disc«ders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, petipheral nervous system disorders, domiatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myastiienia gravis, periodic 

30 paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathcsia, amnesia, catatonia, diabetic neuropathy, hemiplegic migraine, tardive 
dyskinesia, dystonias, paranmd psychoses, postherpetic neuralgia, Tourette*s disc»der, progressive 
supranuclear palsy, corticobasal degeneration, and familial fiontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 

35 dystrophy, myotonic dystrophy, central core disease, nemaline myopatiiy, centronuclear myopathy. 
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lipid myopathy, mitochondrial myopathy, infectious myositis^ polymyositis, dennatomyositis, 
taclusion body myositis, thyrotoxic myopathy, etbanol myopathy, angina, anaphylactic shook* 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochtomocytoma, and myopathies including encephalopathy, 
5 epilepsy* Keams-Sayte syndrome, lactic acidosis, myoclomc disorder, ophtbahnoplegia, acid 
maltase deficiency (AMD, also known as Pompe*s disease), generalized myotonia, and myotonia 
congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis* anenua» asthma, adierosclerosis, autoimmune hemolytic anemia, autoimmune 

10 thyroiditis, autoimmune polyendociinopathy-candidiasis-ectodennal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Cfohn*s disease, atopic dennatitis, dennatomyositis, 
diabetes mellitus, ^physema, episodic lymphopenia with lym^hocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome^ gout. 
Graves' disease, Hashimoto's thyroiditis, hypereosinoplulia, irritable bowel syndrome, multiple 

15 sclerosis, myasthenia gravis, myocardial or pericardial inflammaticm, osteoarthritis, osteoporosis, 
pancreatitis^ polymyositis, psoriasis, Reiter^s syndrome, rbeumatoid arthritis, sclerodemm, 
Sjogren's syndrouK, systemic anaphylaxis, systemic lupus erytfaertiatosus, sys^temic sclero^, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, corrq>lications of cancer, 
hemodialysis, and extracorporeal circulatioQ, viral, bacterial, fiingal, parasitic, protozoal, and 

20 helminthic infections, and trauma; and a cell proliferative discxder such as actinic kemtosis, 
arteriosclerosis, ath^osclerosis, bursitis, cinliosis, hepatitis, toixed cormective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hempglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lyn^homa, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrerud gland, Uadder , bone, 

25 bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrmntestmal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fiagment or derivative 
thereof nmy be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of TRICH including, but not lirruted to, those described above. 

In a further embodiment, a composition comprismg a substantially purijBed TRICH in 
conjunction witii a suitable pharmaceutical carrier may be adrmnistered to a subject to treat or 
prevent a disorder associated with decreased expression or activity of TRICH including, but not 
limited to, those provided above. 

35 hi still another embodiment, an agonist which modulates the activity of TRICH may be 
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administered to a subject to treat or pievent a disorder associated with decreased expression or 
activity of TRICH including, but not limited to, those listed above. 

In a further embodhtnent, an antagonist of TRICH may be administered to a subject to treat 
or prevent a disorder associated with moreased expression or activity of TRICH. Examples of such 
5 disorders include, but are not limited to, those transport, neurological, muscle, immunological and 
cell proliferative disorders described above. lin one aspect, an antibody which specifically binds 
TRICH may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TRICH. 

In an additional enibodiment, a vector ^Epressu&g the conq^lement of the polynucleotide 

10 encodmg TRICH may be administered to a subject to beat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, anUgonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
th^peutic agents. Selection of the appropriate agents for use in combination thenqpy may be made 

15 by one of ordinary skill in the ait, according to conventional pharmaceutical principles. The 

combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disoni^ described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be praduced using methods which are generally known in the 

20 art In particular, purified TRICH rnay be used to produce antibodies or to screen U^^ 

pharmaceutical agents to identify those which specifically bind TRICH. ' Antibodies to TRICH may 
also be generated using metiiods that are well known in the art Such antibodies may mclude, but 
are not limited to, polyclonal, monoclmal, chimeric, and smgle chain antibodies. Fab fragments, 
and fragments produced by a Fab expression library « Neutralizing antibodies (i.e., those which 

2S inhibit dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats* mice, humans, 
and others may be immunized by injection with TRICH or with any fragment or oUgopeptf <te 
thereof which has immunogenic properties. Depending on the host species, various adjuvants may 
be used to increase immunological response. Such adjuvants include, but are not limited to, 

30 Freund*s, mineral gels such as aluminum hydroxide, and surface active substances such as 

lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-<}uerin) and Corvnebacterium narvum are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
35 TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
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comist of at least about 10 amino acid$. It is also preferable that these oligopeptides, peptides* or 
firagments are identical to a portion of the amino acid sequence of the natural protdn. Short 
stretches of TRICH amino acids may be fused with those of another protein* such as KLH, and 
antibodies to the chimeric molecule may be produced. 
5 Monoclonal antibodies to TRICH may be prepared using any technique which provides for 

the production of antibody molecules by continuous cell lines hi culture* These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique. (See, e.g., KoUer, O. et al. (197S) Nature 256:495*497; Kozbor, D. et al. 
(1985) J. linmunol. Methods 81:31-42; Cote, R.J, et aL (1983) Proc. Nafl. Acad. Sci, USA 

10 80:2026-2030; and Cole, SJ*. et aL (1984) Mol CeU Biol. 62:109-120.) 

Id addition, techniques developed for the production of "chimeric antibodies " such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Monison, SX. et al, (1984) Proc. 
Natl. Acad Sci. USA 81:6851-6855; Neubesger, M.S. et al. (1984) Natme 312:604-608; and 

15 Takeda, S. et al. (1985) Nature 314:452-454) Alternatively, techniques described for the 

production of single chain antibodies may be adapted, usitkg methods known in the art» to produce 
TRICH-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e,g.. Burton, D Jl* (1991) Proc. Natl. Acad. Sci. USA 88: 10134- 

20 10137.) 

Antibodies may also be product by niducinginviyo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. NaU. Acad. ScL USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

25 Antibody fragments which contain specific binding sites for TRICH may also be generated. 

For example, such firagments include, but are not limited to. Fidib% fragoaents produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab*)2 fragments. Alternatively, Fab expression libraries may be constmcted to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

30 et al. (1989) Science 246: 1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or iramunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 

35 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
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reactive to two non-mteifeiing TRICH epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra^. 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TRICE Affinity is expressed as an 

5 association constant. Kg, which is defined as the molar concentration of TRICH-antibody complex 
divided by the molar concentrations of free antigen and firee antibody under equilibrium conditions. 
The detf^nnined for a proparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple TRICH ^itopes, represents the average affinity, or avidity, of the antibodies 
for TRICH. The detennined for a prepararion of monoclonal antibodies, which are monospecific 

10 for a particular TRICH epitope, represents a true measme of afiBnity. Hig}i-^fiinity antibody 
preparations with raiiging fix>m about lO' to 10'^ L/mole are prefened for use in immunoassays 
m which the TRICH-antibody complex must witiistand rigorous manipulations* Low-affinity 
antibody preparations with ranging from about 10^ to 10^ L/mole are preferred for use m 
imiKumopurification and similar procedures which ultimately require dissociation of TRICH, 

15 preferably in active form, from the antibody (Catty, D. (1988) Antibodies, yolume T; A Practical 
A pproach. WL Bress, Washmgton DC; Liddell, J.B. and A. Cry^ (1991) A Practical Guide to 
Monoclonal Antibodies. Jcdm WHev & Sans. New York NYI. 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 

20 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably S-10 mg specific antibody/ml, is generally employed in procedures requiring 
precipitation of TRICH-antibody complexes. Procedures for evaluating antibody specificity, tit^, 
and avidity, and guidelines for antibody quality and usage in various applications, are generally 
available. (See, e g.. Catty, supra , and Coligan et al. supra. ) 

25 In another embodiment of the invention, the polynucleotides encoding TRICH, or any 

fragment or coioyplement thereof, may be used for therapeutic purposes. In one aspect, 
modifications of gene expression can be achieved by designing complementary sequences or 
antisense molecules (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory 
regions of the gene encoding TRICH Such technology is well known in the art, and antisense 

30 oligonucleotides or larger fragments can be designed from various locations along the coding or 
control regions of sequences encoding TRICH (See, e.g., Agrawal, S., ed. (1996) Antisense 
Therapeutics. Humana Press Inc., Totawa NJ.) 

hi theiapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 

35 intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
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complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g.. 
Slater, J.E. et al. (1998) J. AUergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of 
viral vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.» MiDear, (1990) 
5 Blood 76:271; Ausubel, supra: Uckert, W. and W. Walther (1994) Pharmacol Ther. 63(3):323- 
3470 Oth^ gene deliveiy mechanisms include liposome-derived systems* artificial viral envelopes, 
and other systems known m the art (See, e.g.. Rossi* IL (1995) Br. Med. Bull. 51(l):217-225; 
Boado, RX et al. (1998) J. Pharm. Sci. 87(n):1308"1315; and Mtorris, M,C et al. (1997) Nucleic 
Acids Res. 25(14):2730-27360 

10 In another embodiment of the invention^ polynucleotides encoding TRICH may be used for 

somatic or g^mdine gene thmpy. Gene therapy may be performed to (i) conect a graetic 
deficiency (e.g.» in the cases of severe combined immunodeficiracy (SCSDyXl disease 
characterized by X-ltnked inheritance (Cavazzana^Ivo, M. et al. (2000) Science 288:669-672), 
severe combuned immunodeficiency syndrome associated with an inherited adenosine deaminase 

15 (ADA) deficiency (Blaese, RJM. et al. (1995) Science 270:475-480; Bordigaon, C et al. (1995) 
Science 270:470475). cystic fibrosis (Zalmer, J. et aL (1993) CeD 75:207-216; Qyslal, R-G. et al. 
(1995) HunL Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), 
thalassamias, £amilial hypercholestmx>lemia, and hemophilia resulting from Factor vm or Factor IX 
defici^cies (Qrystal, R.G. (1995) Science 270:404410; Verma, LM. and N. Somia (1997) Nature 

20 389:239-242)), (ii) express a conditionaUyledial gene product (e.g., in the case of cance^^ 

result from unregulated cell proliferation), or (iii) express a i»oirin which affords protection against 
intracellular parasites (e.g., against human retroviruses, such as human imanmodefidency virus 
(mV) (Baltimore. D. (1988) Nature 335:395-396; Poeschla. R et al. (1996) Froc. Nafl. Acad. Sci. 
USA 93:11395-11399X hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida 

25 albicans and Paracoccidioides htasiliensis: and protozoan parasites such as Plasmodium falciparum 
and Tr ypanosoma cruziV hi the case where a genetic deficiency in TRICXI expression or regulation 
causes disease, the expression of TRICH from an appropriate population of transduced cells may 
alleviate the clinical manifestations caused by the genetic deficiency. 

h) a further embodunent of the invention, diseases or disorders caused by deficiencies in 

30 TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer teclmologies 
for use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and Anderson (1993) Annu. Rev. 

35 BiochenL 62:191-217; Ivies. Z. (1997) CfeD 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr, 
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Opin. Biotechnol. 9:44S4S0). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
limitfid to. the PCDNA 3.1, EPirAG. PRCCMV2. PREP, PVAX, PCR2-TOPOTA vectors 
(hivitrogen, Carisbad CA). PCMV-SCRIPT. PCMVTAG. PEGSHKPERV (Stratagene, U JoUa 
5 CA),andPTET-OEF.PTET-ON,PTRE2.Fnffi2-LU^ 

TRICH may be expressed using (i) a constitutively active promoter, (e.g*, from cytomegalovirus 
(CMV)» Rous sarcoma virus (RSV), SV40 vims, thynaidine kinase (TK). ot p-actin genes), (ii) an 
inducible promoter (e.g., the tetracycline^gulated promoter (Gossen, M. and H. Bujard (1992) 
Rioc. NatL Acad. Sci, USA 89:5547-5551; Gossen. M, et al. (1995) Science 268:1766-1769; Rossi. 

10 FM.V. and RM. Blau (1998) Out. Opin. BiotechnoL 9:451-456), commercially available in the 
T-REX plasnud (bvitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR 
and PIND; Mvifrogen); the FK506^amycm inducible promote or the RU486/mifepristone 
inducible promoter (Rossi, FM,V. and KM. Blau, sunratt , or (iii) a tissue^ecific promoter or die 
native {Hromoter of the endog^ous gene encoding TRICH from a normal individual 

15 Commercially available liposome ttansfonnadion kits (e.g., the PERFECT LIPID 

TRANSPECnON KIT, available from Ihviuogen) allow one with ordinary skill m the art to deliver 
polynucleotides to target cells in cultuie and require minunal effcHt to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FX,, and A J* Eb (1973) Virology 52:456-467). or by electroporation (Neumann, R et al. 

20 (1982) EMBO L 1:841*845). The intinoduction of DNA to primary cells requkes modification of 
these standardized mammalian transiection protocols. 

In another embodiment of the mventim, diseases or disorders caused by genetic defects 
with respect to TRICH expression are treated by constructing a retrovuus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an indepmdent promoter or the retrovirus long 

25 temiinal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-iesponsive 
elem^t (RRE) along with additional retrovirus c/^-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, L et al. (1995) Proc. 
Nati, Acad* Sci* USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 

30 an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al, (1987) L ViroL 61:1639-1646; Adam, M.A. and 
A.D. MiUer (1988) J. Virol. 62:3802-3806; Dull. T. et al. (1998) J. Virol. 72:8463-8471; Zufiferey, 
R. et al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg Cmthod for obtaining 

35 retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant*') 
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discloses a method for obtaimng retrovirus packaging cell lines and is hereby incoiporated by 
reference. Propagation of retrovirus vecttas, transduction of a population of cells (e.g., CD4* T- 
cellsX and the return of transduced ceUs to a patient are procedures well known to persons skilled in 
ihe art of gene therapy and have be^ well documented (Ranga, U. et al (1997) J. ViroL 71:7020- 
5 7029; Bauer. G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M,L. (1997) J. ViroL 71:4707-4716; 
Ranga, U. et al. (1998) Rroc, Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In the alternative, an adenovirus-based gene thmpy delivery system is used to deliver 
polynucleotides eucoding TRICH to cells which have one or more graietic abnonnalities widi 

10 respect to the e}q>ression of TRICH. Theconstnictionandpackagingof adenovirus-based vectors 
are well known to those with ordinary skill in the art Replication defective adenovirus vectors have 
proven to be versatile for intporting g^es encoding immunoregulatory proteins into intact islets in 
the pancreas (Csete, M.E. et al. (1995) Transplantation 27:263-268). PotentiaUy useful adenoviral 
vectors are described m U.S. Patent No. 5,707,618 to Arm^tano ("Adenovirus vectors for gene 

15 therapy*'), hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. 
(1999) Annu. Rev. Nutr. 19:51 1-54( and Vemoa, LM* and N. Somia (1997) Nature 18:389:239-242, 
both incorporated by reference herem. 

In another alternative, a bcrpes-based, gene therapy deliv^ system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 

20 respect to the expression of TRICR The use of herpes siiqplex vmis (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HS V 
has a tropism- The construction and packaging of herpes-based vectors aie well known to those 
with ordinary skill in the art. A replication-competent herpes sin^>lex vims (HSV) type 1-based 
vector has been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. 

25 Eye Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail 
in U S. Patent No. 5,804,413 to DeLuca CHerpes sio^lex virus strains for gene transfer*'), which is 
hereby incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell 
under the control of the appropriate promoter for purposes including human gene therapy. Also 

30 taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, 
ICP27 andICP22. For HSV vectors* see also Coins, et al. (1999) J. Virol. 73:519-532 and Xu. 
H. et al. (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of 
cloned herpesvurus sequences, the generation of reconabinant virus following the transfection of 
multiple plasmids containing different segments of the large herpesvirus genomes, the growth and 

35 propagation of heipesvims» dnd the infection of cells with heipesvirus are techniques well known to 
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those of oidioaty skill in the art. 

In another altemative, an alphavinis (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides oicoding TRICH to target cells. The biology of the prototypic alphavirus. 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
5 on the SFV g^ome (Gan>ff» R and K.-I. U (1998) Curr. Opin. Btotechnol. 9:46M69). During 
alphavinis RNA replication, a subgenomic RNA is generated that nonnally encodes the vual capsid 
proteins. This subgenomic RNA replicates to higher levels than the fiill length genomic RNA, 
resulting in the oveiproduction of capsid proteins relative to the viral proteins with ens^matic 
activity (e.g., protease and polymerase). Similarly, inserting the coding sequence for TEUCH into 

10 the alphavinis genome in place of the capsid-coding region results in the production of a large 
number of TRICH-coding RNAs and the synthesis of lugji levels of TRICH in vector transduced 
cells* While alphavims infecdon is typically associated with cell lysis within a few days, the abiUiy 
to establish a persistent infection in hamster nonnal Iddney cells (BHK-21) with a variant of SindUs 
virus (SIN) indicates chat tfie lytic replication of alphavuruses can be altered to suit the needs of the 

15 g^e thfiarapy q)pUcation CDiyga, S* A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the uatroduction of TEtlCH into a variety of cell types. The specific 
transduction of a subset of cells in a population may requuie the sorting of cells prior to 
transduction. The methods of manipulating infectious cDNA clones of alphaviruses, performing 
alphavinis cDNA and RNA transfections, and perfonnmg alphavims infections, are well known to 

20 those with ordinary skiU in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
>10 and -f 10 from the start site, may also be employed to inhibit gene expression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the double helix to open suffidentiy for the 

25 binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature. (See, e.g.. Gee, J,E. et al. (1994) in Huber, 
B.E. and B,L Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt Kisco NY, pp. 
163-177.) A complenaentaiy sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

30 Ribozymes, enzymatic RNA molecules, may also be used to catalyze tiie specific cleavage 

of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficientiy 
catalyze endonucleolytic cleavage of sequences encoding TRICH. 

35 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
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scanning the target molecule for libozyme cleavage sites, including the following sequences: GUA, 
GUU, -md GUC. Once identified^ short RNA sequences of between IS and 20 ribonucleotides* 
conesponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary stnictural features which may render the oligonucleotide inoperable. The suitability of 

5 candidate targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepaied 
by any method known in the art fix* the synOiesis of nucleic acid molecules. These mclude 
techniques for chemically synAhesidng oligonucleotides such as scdid phase phosphoranndite 

10 chi^cal synthesis. AJtMnativelv. RNA molecules mav be generated bv in vitro and in vivo 

transcription of DNA sequences encoding TRICR Such DNA sequences may be inc(»p(»a£ed mto 
a wide variety of vectors widi suitable RNA polymerase promoters such as T7 or SP6. 
Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

IS RNA molecules may be modified to increase intraceUularstabili^ and half-lifa Possible 

modifications include, but are not limited to, the addition of flanking sequences at the S' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' Ometfay 1 rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 

20 inosine, queosine, and wybutosine, as well as acetyl-, methyK fhio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in alt^g expression of a polynucleotide encoding TRICH- 

25 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but arc not limited to» oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors 

30 or promoters of polynucleotide expression. Thus, in the treatment of disorders associated with 
increased TRIC3I expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeuticaUy usefiU, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a conq>ound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

35 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
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altexing expiession of a specific polynucleotide* A test compound may be obtained by any method 
commonly known in the ait, includ'mg chenucal modification of a compound known to be effective 
in altering polynucleotide expiession; selection from an existing, commercially-available or 
proprietaiy libiaiy of naturally-occurring or non-natural chemical compounds; rational design of a 
5 compound based on chemical and/or structural prop^es of Ifae taiget polynucleotide; and selection 
from a library of chemical compounds created combinatorially or randomly. A sample comprising 
a polynucleotide encoding TRICH is exposed to at least one test conqpound thus obtained. The 
sample may comprise, for exainple* an intact or permeabilized cell, or an in vitro cell-free or 
reconstituted biochemical ^stem. Alterations in the expiession of a polynucleotide encoding 

10 TRICH are assayed by any method commonly known in the ait Typically^ the expiession of a 
specific nucleotide is detected by hybridization with a probe having a nucleotide sequence 
complementary to die sequence of the polynucleotide encoding TRICH. The amount of 
hybridization may be quantified, thus forming the basis for a conq^arison of the expiession of the 
polynucleotide both with and without exposure to one or mom test compounds. Detection of a 

15 change in the expression of a polynucleotide exposed to a test compound indicates diat the test 
compound is ^ective in altning the expression of the polynucleotide. A screen for a conqiound 
effective in altering expression of a specific polynucleotide can be carried out» for ^campK using a 
Schizosaocharomvces pombe gene expression system (Atkins^ D. et al. (1999) U.S. Patent No. 
5«932,435; Amdt, aM. et al. (2000) Nucleic Acids Res. 28:E15) or a human cell line such as HeLa 

20 cell (Clarke, M.L. et al. (2000) Biochcm Biophys. Res. Commun. 268:S-13). A particular 

embodiment of the present invention involves screemng a combinatorial library of oligonucleotides 
(such as deoxyribonucleotides, ribonucleotides, pqptide nucleic adds, and modified 
oligonucleotides) for antisodse activity against a specific polynucleotide sequence (Biuice, T.W. et 
aL (1997) Patent No. 5,686,242; Bruice, T.W. et al. (2000) U*S. Patent No. 6.022,691). 

25 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo, invitm. and ex vivo . For ejLvivo tiierapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be 
achieved using methods which are well known in the art (See, e.g., Goldman, QK. et al. (1997) 

30 Nat. Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, maminals such as humans, dogs, cats, cows» horses, rabbits, 
and monkeys. 

An additional embodiment of the invention lelates to the administration of a composition 
35 which generally comprises an active ingiedient formulated witii a pharmaceutically acceptable 
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exdpient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various foimulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing. Easton PAl Such compositions may 
consist of TRICH, antibodies to TRICH. and numetics, agonists, antagonists, or inhibitors of 
5 TRIOL 

The con]positions utilized in this invration may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intradiecal, intraventricular, pulmonary, transdermal, subcutaneous, intrap^tonea], intranasal, 
enteral, topical, sublmgual, or rectal means. 

10 Compositions for pulmonaiy administration may be prepared in liquid or dry powder form. 

These con^ositions are generally aerosolized immediately prior to inhalation by the pati^t In the 
case of small molecules (e.g. traditional low molecular wdght organic drugs), aerosol delivery of 
fast-acting formulations is well-known in tfie art In the case of macromolecules (e.g. larger 
peptides and protenis), recent developn^nts in the field of pulmonary delivery via the alveolar 

15 region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Pattoii, J.S. et al.» U.S. Piatent No. 5,997,848). Puhnonary delivery has the advantage of 
admmistration without needle injection, and obviates the need for potentially toxic penetration 
enhancers. 

Conqx>sitions suitable for use in the invention include compositioiis wherein the active 
20 ingredients are contained in an effective amount to achieve the intended purpose. Tlie 
determination of an effective dose is well within the capability of ttiose skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular deliver of 
macromolecules con^rising TRI(3I or fragments thereof. For example, liposome preparations 
containing a ceU-impermeable macromolecule may promote cell fusion and intracellular delivery of 
25 the macromolecule. Alternatively, TRICH or a firagment thereof may be joined to a short cationic 
N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. 
et al. (1999) Science 285: 15694572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
30 culture assays, e.g., of neoplastic cells» or in animal models such as mice, rats, rabbits, dogs, 
monlceys, or pigs* An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
35 TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of 
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TRICH, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with e3q>erimental animals, 
such as by calculating the ED^^^ (the dose th^:^utically effective in 50% of the population) or LD50 
(the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is 
5 the therapeutic mdex, which can be expressed as the LD^/EDso ratio* ComposiUons which exhibit 
large therapeutic indices are prefeEied. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained m such 
conq)ositions is preferably within a range of circulating concentrations that includes the ED50 with 
litde or no toxicity. The dosage varies within this range dependbg upon the dosage form employed^ 

10 the sensitivity of the patient, and the route of adnumstratioa. 

The exact dosage will be detennined fay the practiticmer, in light of factors related to the 
subject requiring ta:eatment Dosage and adniinistration are adjusted to provide sulSicient levels of 
the active moiety or to maintain the desired effect. Factors which may be taken into account inciude 
tte severity of the disease state, the gen^ health of the subject, the age* weight, and gender of the 

15 subject, time and frequ^cy of adnunistradao, drug con]bination($), reaction safisitivities, and 
response to therapy. Long-acting composittons may be administered every 3 to 4 days» every week, 
or biweekly depending on the half-Jife and clearance rate of the particular fonnulation* 

Normal dosage amounts may vary from about 0.1 fjtg to 100,000 Mgt up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

20 methods of delivery is provided in the literature and ^nerally available to practiticmers in the art 
Those skilled in the art will employ differmt formulations for nucleotides than for proteins or their 
mhibitors. Similarly, deliveiy of polynucleotides or polypeptides wHl be specific to particular cells, 
conditions, locations, etc. 
DUGNOSTICS 

25 In another embodiment* antibodies which specifically bind TRICH may be used for the 

dia^osis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists* or inhibitors of TRICH. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TRICH include meUiods which utilise the antibody and a lal:^! to detect 

30 TRICH in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used, 

A variety of protocols for measuring TRICH, including ELISAs, RIAs* and FACS, are 

35 known in the art and provide a basis for diagnosing altered or abnom:^ levels of TRICH expression. 
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Normal or standard values for TRICH expression are established by combining body fluids or cell 
extracts taken from normal noammalian subjects, for example, human subjects, with antibodies to 
TRICH under conditions suitable for complex fomiation. The amount of standard conoplex 
formation may be quantitated by various mefi)ods» such as photomehic means. Quantities of 
5 TRICH expressed in subject, control, and disease samples from biopsied tissues are con^aied with 
the standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In anoth^ embodunent of the invration, the polynucleotides encoding TRICH may be used 
for diagnostic purposes. The polynucleotides which may be used mclude oligonucleotide 
10 sequences, con^lementary RNA and DNA molecules, and PNAs. The polynucleoddes may be used 
to detect and quantify gene expression in biopsied tissues in which expression of TRICH may be 
correbted with disease. The diagnostic assay may be used to deterixune absence, presence, and 
excess expression of TRICH, and to monitor regulation of TRICH levels during thm^}eutic 
int^ention* 

15 In one aspect, hyhridizaticm with PCR probes which are capable of detectmg polynucleotide 

sequences, including gnomic sequences, ^coding TRICH or closely related molecules may be 
used to identify nucleic acid sequences which encode TRICH. Hie specificity of the probe, wheth^ 
it is made from a highly specific region, e.g.> the 5' regulatory re^on, or fi'om a less specific region, 
e.g., a conserved motif, and the stringency of the hybridization or amplification will determine 

20 whether the probe identifies only naturally occurring sequences encoding TRICH, allelic variants, 
or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or 
25 from genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 
cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 
production of ndEiNA probes* Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by n^ans of the addition of the appropriate RNA 
30 polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups* for example, by radionuclides such as ^ or ^^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleodde sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH Exan^les of such disorders include, but are not lunited to, a 
35 transport disorder such as akinesia, amyou-ophic lateral sclerosis, ataxia telangiectasia, cystic 
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fibrosis, Becker's muscular dystrophy. Bell's palsy, Quffcot-Marie Tooth disease, diabetes mellitus, 
diabetes insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic 
paralysis, nonnokalemic periodic paralysis, Parkinson's disease, malignant hyperthennia, multidrug 
resistance, myasthenia gravis, myotome dystrophy, catatonia, tardive dyskinesia, dystonias, 

5 peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e,g„ angina, bradyarrythmia, tachyarrythjowa, hypertension. Long QT syndiome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopadiy, 
mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 

10 Alzheimer's disease, amnesia, bipolar disorder, dementia, depiession, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., , 
neurofibromatosis, postherpetic neuralgia, trigeminal n^iropathy, sarcoidosis, sickle cell anemia, 
Wilson's disease, cataracts, infertility, putaaonary artery stenosis, sraisorineural autosomal deafiiesSt 
hyperglycemia, hypoglycemia. Grave's disease, goiter, Cushing's disease, Addison's disease, 

15 glucose-galactose malabsoiption syndrome^ glycogen storage disease, hyp<»rcholestBan)I^a, 

adienoleukodystrophy, Zellweger syndrome, Menkes disease, occq^ital horn syndn»ne, von Gierke 
disease, pseudohypoaldosta:onism type 1, Liddle*s syndrome, cystinuria, iminoglycinum, Hartup 
disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
ce^ebiovascular disease* stroke, cerebral nec^lasms, Alzheimer's disease. Pick's disease, 

20 Huntington's disease, dementia, Parkinson's disease and odi^ extn^ramidal disorders, 
amyotrophic lateral sclerosis and other motor neunm disorders, prpg^'^^vc neural muscular 
atrophy^ retinitis pigmentosa, hereditary ataxias, multiple scleiosis and other demyelmating 
diseases, bacterial and viral meningitis, brain abscess, subdural eo^yema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central n^ous system 

25 disease, prion diseases including kuru, Qeutzfeldt Jakob disease, and Gerstmann- 

Straussler-^heinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
n^ous system, neurofibromatosis, tuberous sclerosis, cerebeUoietinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developm^tal disordeis of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

30 nervous system disorders, cranial nerve disorders, spmal cord diseases, muscular dystrophy and 
other neuromuscular disc»ders, peripheral nervous system disorders, dexmatomyosids and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasdienia gravis, periodic 
paralysis, mental disorders including mood, anxiety, and sdnzofrfuenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatouia, diabetic neuropathy, hemiplegic migraine, tardive 

35 dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
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supnmuclear palsy » corticobasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as caidiomyopathy, myocarditis, Duchemie's muscular dystrophy, Becker's muscular 
dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dennatomyositis, 
5 inclusion hody myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhydmiias, asthma, cardiovascular shock. Gushing* s syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 
epilepsy, Kearns-Sayte syndrome, lactic acidosis, myoclonic disotder, ophthalmoplegia, acid 
maltase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 

10 cong^ta; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma^ ath^x>5clerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoinunune polyendocrinopathy-candidiasis-ectodecmal dystrophy (APEC£D), 
bronchitis, cholecystitis, contact dermatitis, Ctohn's disease* atopic dermatitis, dermatomyositis, 

15 diabetes mellims, enqihysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glorn6niIonq;»hrid5, Goodpasture's syndrome, gput. 
Graves' disease, Hashimoto's thyroiditis, hypeieosinophilia, irritable bowel syndrome, multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reit^'s syndrome, rheumatoid arduitis, scleroderma, 

20 SjOgr^'s syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Wmier syndrome, complications of canc^, 
hemodialysis, and ^ctracorporeal circulation, vical, bactmal, fungaL parasitic, protozoal, and 
hekointhic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
artmosclerosis, atherosclerosis^ bursitis^ chrhosis, hepatitis, mixed connective tissue disease 

25 (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 

primaiy tfarond>ocythemia, and cancers including ad^ocarcinoma, leukemia, lyn^homa, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skm, spleen, testis, 

30 thymus, thyroid, and uterus. The polynucleotide sequences encoding TEUCH may be used m 
Southern or northern analysis, dot blot, or oth^ membrane-based technologies; in PGR 
technologies; in dipstick, pin, and multiformat ELESA-like assays; and m microarrays utilizing 
fluids or tissues from patients to detect altered TRICH expression. Such qualitative cmt quantitative 
methods are well known in the art. 

35 In a particular aspect, the nucleotide sequences encodmg TRICH may be useful m assays 
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that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue 
5aoq>Ie from a patient under conditions suitable for the fimnation of hybridization complexes. After 
a suitable mcubation period, the sample is washed and the signal is quantified and conipared with a 
5 standard value. If the amount of signal in the patient sample is significantly altered in coiapsnson 
to a control sample then the presence of altered levels of nucleotide sequences encoding TRICH in 
the sanq>le4ndicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies* in clinical 
trials, or to monitor the treatment of an individual patient 

10 In €»der to provide a basis for the diagnosis of a disorder associated with e^qnession of 

a normalcy standard profile for expressicm is estabU^ This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fiagment thereof, encoding TRICH, under conditions suitable for hybridization or 
anq)lification. Standard hybridization may be quantified by con^»aring the values obtained fiom 

15 normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is use! Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are syrnptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

20 hybridization assays may be repeated on a regular basis to detemune if the level of expression in the 
patient begins to approximate that which is observed in the normal subject The results obtained 
ficm successive assays may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

25 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 
appeajrance of actual cliDical symptoms, A moie definitive diagnosis of this type may aUow health 
professionals to employ preventative measures or aggressive treatment earUer thereby preventing 
the development or further progression of the cancer. 

30 Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

TRICH may involve the use of PGR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding TRICH, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding TRICH* and will be employed under optimized conditions for 

35 identification of a specific gene or condition. Oligomers may also be employed under less stringent 
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conditions for detection or quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from tlie polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs aie 
substitutions, insertions and deletions that ate a frequent cause of inherited or acquired genetic 
S disease in humans. Methods of SNP detection include, but are not linuted to, single-stranded 
confonnation polymorphism (SSC3P) and fluorescent SSC? (fSSCP) methods. la SSCP, 
oligonucleotide pfhners derived from the polynucleotide sequences encoding TRICH are used to 
mpUfy DNA using the polym^ase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 

10 differences in the secondary and t^ary structures of PCR products in single-stranded form, and 
these differences aie detectable using gel electrophoresis in non-denaturing gds. hi fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the an^Umers in high- 
throughput equipment such as DNA sequencing machines. Additiomdly, sequence database 
analysis methods, termed in silico SNP (isSNP), are c£^ble of identii^tag polym(»rphisms by 

15 comparing the sequence of individual overlapping DNA fragments which assemble mto a conmion 
consensus sequence. These con^uter-based methods filter out sequence variations due to 
laboratory preparation of DNA and sequencing errors using statistical models and automated 
analyses of DNA sequence chromatograms. b the alternative, SNPs may be detected and 
characterized by mass spectrometty using, for example, the high throughput MASSARRAY system 

20 (Sequenom, Iix:., San Diego CA). 

Methods which may also be used to quantify the expressim of TRICH include radiolabeling 
or biotinylating nucleotides, coamplificaticm of a control nucleic acid, and interpolatuig results from 
standard curves. (See, e.g., Melby, EC et al. (1993) J, Immunol. Methods 159:235-244; Duplaa, C 
et al. (1993) Anal. Biochem- 212:229-236.) The speed of quantitation of multiple saoQiles may be 

25 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide 
of interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microairay. The 

30 microarray can be used in transcript imagmg techniques which monitor the relative expression 
levels of large numbers of genes simultaneously as described below. The microaxray may also be 
used to identify genetic variants, mutations, and polymoiphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

35 monitor the activities of therapeutic agents in the treatment of disease. In particular, this 
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infoimation may be used to develop a pharDQacogenoinic profile of a patient in order to select the 
most appropriate and efTective treatment regimen for that patient For exan^le* therapeutic agents 
which are highly effective and display the fewest side effects may be selected for a patient based on 
his/her phannacogenomic profile. 
5 In another embodiment, TRICH, fiagments of TKICH, or antibodies specific fc^ TRICH 

may be used as elements on a microairay. The microaixay may be used to monitor or measure 
protein-protein inteiactions, drag-target interactions, and gene expression profiles, as described 
above. 

A particular embodhnent relates to the use of the polynucleotides of the present inventim to 

10 generate a transcript unage of a tissue or cell type. A transcript image represents the global pattern 
of ^e e;q>ression by a particular tissue or ceQ type. Global gene expression patterns aie analyzed 
by quantii^ng the number of expressed genes and their relative abundance under given conditions 
and at a giv^ time. (See Seilhamer et aL. '^Con^mrative Gene Transcript Analysis," U.S. Patent 
No. 5,840,484, expressly incoiporated by reference herein*) Thus a transcript image may be 

IS generated by hybridising the polynucleotides of the present invention or then: complements to the 
totality of transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, 
the hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their conq>lements conqnise a subset of a plurality of elements on a microanay. The 
resultant transcript image would provide a profile of gene activity. 

20 Transcript inmges may be generated using transcripts isolated firom tissues* ceU 

biopsies, or other biological sanq)les. The transcript image may ttius reflect gene expression in 
vivo, as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
inventicm may also be used in conjunction v^ith in vitro model systems and preclinical evaluation of 

25 pharmaceuticals, as weU as toxicological testing of industrial and natuFally-occuning environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, EP. el al. (1999) MoL Caicinog* 24:153-159; Steiner, S. and Ni. Anderson 
(2000) Toxicol I-ett. 1 12-1 13:467-471. expressly incorporated by reference herein). If a test 

30 compound has a signatura similar to that of a compound with known toxicity, it is likely to share 
those toxic properties. These fmgerprints or signatures are most useful and refined when they 
contain expression information from a large number of genes and gene families. Ideally^ a genome- 
wide measurement of expression provides the highest quality signature. Even genes whose 
expression is not altered by any tested confounds are important as well, as the levels of expression 

35 of these genes are used to normalize the rest of the expression data. The normalization procedure b 
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useful for comparison of expiessiou data after tiealment with different cotnpounds. While the 
assignment of gene fiinction to elements of a toxicant signature idds in interpietation of toxicity 
mechamsms. knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example* Pre^s Release 00-02 from die National 

5 bstitute of Environmental Health Sciences, released February 29, 2000, available at 

http-7/www.niehs,nih*gov/oc/news/toxchip.htni.) Thercfare, it is inqpoitant and desirable in 
toxicoiogical so^ening using toxicant signati»es to include all expressed gene sequences. 

Jn one embodunent, the toxicity of a test confound is assessed by treating a biological 
sample containing nucleic adds widi the test conapound. Nucleic acids that are expressed in the 

10 treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
(he present invention, so that transcript levels correspondsng to the polynucleotides of the preset 
inv€3ition may be quantified. The transcript levels in the treated biological saisqple are compared 
with levels in an untreated biological sample. Differences in the transcript levels between the two 
samples are indicative of a toxic response caused by the test conqKiund in the treated $aiiq>le. 

IS Another particular embodiment relates to the use of the polypeptide sequences of the 

ptesent invention to analyze the proteome of a tissue or cell type. The tenn proteome refers to the 
global pattern of protein expression in a particular tissue or cell type. Each piotem component of a 
proteome can be subjected individually to further analysis. Proteome expression patterns, or 
profiles, are analyzed by quantifying the number of expressed proteins and their rehtive abundance 

20 uader given conditions and at a given time. A profile of a cell's proteome may thus be generated by 
separating and analyzing the pol^wptides of a particular tissue or cell type, hi one embodiment* the 
separation is achieved utmig two-dim^isional gel dectrophoresis^ m wUcb protems from a sample 
are separated by isoelectric focusing in the first dimension, and (hen accor^g to molecular weight 
by sodium dodecyl sulfate slab gel electrophoresis in tiie second dimonsicm (Stdoier and Anderson, 

25 supra) . The protems are visualized in the gel as discrete and uniquely positioned spots, typically by 
staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the san^le. The 
optical densities of equivalentiy positioned protein spots firom different samples^ for exanqple, from 
biological samples either treated or untreated with a test compound or therapeutic agent, are 

30 compared to identify any changes in protein spot density related to the treatment. The proteins in 
the spots are partially sequenced using, for example, standard metfiods employing chemical or 
enzymatic cleavage followed by mass spectrometry. The identity of the protein in a spot may be 
determined by comparing its partial sequence, preferably of at least 5 contiguous amino acid 
residues, to the polypeptide sequences of the present invention. In some cases, further sequence 

35 data may be obtained for definitive protein identification. 



70 



wo 02/46415 



PCT/US01/4(;9ri3 



A proteoinic profile may also be generated using antibodies specific for TRICH to quantify 
the levels of TRICH expression. In one embodiment, the antibodies ate used as elements on a 
microarray, and protein expression levels are quantified by exposing the microairay to the sample 
and detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. 
5 Biochem. 270:103-111; Mendoze, LG* et al. (1999) Biotechniques 27:778-788). Detection may be 
performed by a variety of methods known in die art, for exanqple, by reacting the proteins in the 
8anq>le with a thiol- or amino-reactive fluorescent coni^und and detectm 
fluorescence bound at each amy element 

Toxicant signatures at the proteome level are also usefiil for toxicological screening, and 

10 should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

chelation between transcript and protein abundances for some proteins in some tissues (Anderson^ 
NI^ and J. Seilhamer (1997) ElectF(f)horesis 18:533-537), so proteome toxicant signatures may be 
useful in the analysis of compounds which do not significantly affect the transcript image, but 
wMch alter the proteomic profile, hi addition, the analysis of transcripts in body fluids is difficult, 

15 due to mpid degradation of mRNA, so proteomic profiling may be more reliable and informative in 
such cases. 

In another embodiment, the toxicity of a test compound is assessed by treatii\g a biological 
sample containing proteins with die test compound. Plroteins that are e^qpressed in the treated 
biological sample are separated so diat the amount of each protein can be quantifi^ Theamountof 

20 each protein is conqpaied to the amount of the conesponding protein in an untreated biological 
sample. A difference in the amount of protein betwe^ the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by 
sequencing the amino acid residues of the individual proteins and conq>aring these partial sequences 
to the polypeptides of the present inv^tion, 

25 hi another embodiment, the toxicity of a test compound is assessed by treating a biological 

sample contahung proteins with the test con[q}ound. Proteins from the biological sample arc 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sanqple is compared with the amount in an untreated biological sample. A difference in the amount 

30 of protein between the two sanq)les is indicative of a toxic response to the test compound in the 
treated sample. 

Microarrays may be prepared^ used, and analyzed using methods known in the art (See, 
e.g., Brennan, T.M. et al. (1995) JJS. Patent No. 5,474,796; Schena, et al. (1996) Proc. Natl, 
Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; 
35 Shalon. D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Nati. Acad. 



71 



wo 02/46415 



PCT/USOl/46963 



Sci. USA 94:2150-2155; and HeUer, M J. et al. (1997) US- Patent No. 5,605,662.) Various types of 
microarrays are weD known and thorou^y described in DNA Microarravs: A Practical Approach , 
M. Schena, ed. (1999) Oxford University Press, London, hereby expressly, incorpoiated by 
lef^nce. 

5 In another embodiment of the invention* nucleic acid sequences encoding TRICH may be 

used to generate hybridization probes useful in mapping the nanually occuning genomic sequence. 
Either coding or nonco(&ig sequences may be used, and in some instances^ noncoding sequences 
may be inferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-g^e fiunOy may potentially cause undesired ctcss hybridization during 

10 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a cfaromosome, or to artificial chromosonae constructions, e.g., human artificial 
chromosomes (HAC^), yeast artificial chromosomes (YAC!s), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harringlton, 
J.J, et al. (1997) NaL Genet 15;345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B J. 

15 (1991) Trends Genet 7: 149-154.) (hice mumped, the nucleic acid sequences of the inventicm may 
be used to develop genetic linka^ maps, for example, which correlate die inheritance of a disease 
state with the inheritance of a particular chromosome region (x restriction fiaginent length 
polymorphism O^IFLP), (See, for example. Lander, E.S. and D. Botstein (1986) Ptoc. Nati. Acad. 
Sci. USA 83:7353-7357.) 

20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

map data. (See, e.g., Heinz-Ulrich. et al. (1995) in Meyers, supra, pp. 965-9680 Examples of 
genetic map data can be found in various scientific journals or at the Online Mendetian Inheritance 
in Man (OMIM) World Wide Web site. Correlation between the location of the gene encoding 
TRICH on a physical map and a specific disorder, or a predisposition to a specific disorder, may 

25 help define the region of DNA associated with that disorder and thus may finther positional cloning 
efforts. 

fix situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another marmnalian species, such as mouse, 

30 may reveal associated markers even if the exact chromosomal locus is not known. This information 
is valuable to mvestigators searching for disease genes using positional cloning or other gene 
discovery techniques. Once the gene or genes responsible for a disease or syndrome have been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
llq22-23, any sequences mapping to that area may represent associated or regulatory genes for 

35 fiirflier investigation, (See. e.g., (Jatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
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sequence of the bstant invention may also be used to detect differences in the chiomosonial 
location due to tntnslocation, inversion, etc., among nonnal, cairier, or affected individuals* 

Jn another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for scieening libraries of compounds in any of a variety of drug 
5 screening techniques. The fragment employed in such screening may be free in solution, affixed to 
a solid support, boame on a ceU surface, or located intracellularly . The formation of binding 
complexes between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput scre^iing of compounds 
having suitable lading ai^i^ to the protein of interest (See, e.g., Geysen, et aL (1984) PCT 

10 application WO84/03S640 In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test conqpounds are reacted with TRICH, at fragments tfa^eof, 
and washed. Bound TRICH is then detected by methods weUimownma^ Purified TRICH can 
also be coated directly onto plates for use in the aforementioned drug scie^iing techniques* 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and imsK»bilize it on a 

15 solid support. 

b another onbodim^t, one may use conqietitive drug screening assays in which 

neutralizing antibodies capable of bindmg TRICH q)ecifically compete with a test compound for 

Unding TRICH. In this manner, antibodies can be used to detect the presence of any peptide which 

shares one or moie antigenic detemnmants with TRICH. 
20 In additional anbodim^ts, the nucleotide sequences which encode TRICH may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques lely 

on properties of nucleotide sequences that are currently known, mcluding, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one sldUed in the art can, using die preceding 
25 description* utilize the present inventiou to its friUest extent The following embodiments are, 

therefore, to be construed as merely illustrative, and not limitative of the remainder of ttie disclosure 

in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, 

including U.S. Sei. No. 60/254303, U.S. Ser. No. 60/256,190, U.S. Ser. No. 60/257^04, U.S. Ser. 
30 No. 60/261.546, U.S. Ser. No. 60/262,832. U.S, Ser. No. 60/264,377, and US. Ser. No. 60/266,019, 

are expressly incorporated by reference herein. 

EXAMPLES 

h Construction of cDNA Libraries 
35 Incyte cDNAs were derived from cDNA libraries described in the LBFESEQ GOLD 
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database (Ihcyte Oenomics* Pa]o Alto CA). Some tissues were homogenized and lysed in 
guanidinium isothiocyanate^ while others weie homogenized and lysed in phenol or m a suitable 
mixture of draaturants, such as TRIZOL (life Technologies), a monophasic solution of phenol and 
guanidine isothiocyanate. The resulting lysates were centiifuged over CsCl cushions or extracted 
5 with chloroform. RNA was precipitated from the lysates with either isopiopanol or sodium acetate 
and ethanol, or by oth^ routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. ]bi some cases. RNA was bx^ated with DNase. For most libraries* poly(A>f RNA was 
isolated using oligo d(T>coupled paramagnetic particles (Promega), OUOOTBX latex particles 

10 (QIAGEN, Chatsworih CA), or an OUGOTEX mRNA purification kit (QIACXBN). Alt^atively, 
RNA was isolated diiecdy from tissue lysates using otiier RNA isolation kits, e.g., the 
POLY(A)FURE niRNA purification kit (Andnon. Austin TX). 

In some cases* Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with tiie 

15 IJNIZAP vector system (Stratagene) <h:SUPERSC3^^ 

the recommended procedures or similar noethods known in the ait (See, e.g., Ausubel, 1997» supra. 
units S. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random piiiners. Syntiietic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was dieted with 
the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected 

20 (300-1000 bp) using SEPHACRYLSIOOO^SEPHAROSBO^, or SEP 

chroxuatograpby (Amersham Phannada Biotech) or preparative agarose gel electrophoresis, 
cDNAs were ligated into compatible restriction enzgrme sites of the polylmker of a suitable plasmid, 
e.g., PBLUESCRIPT pkismid (Stratagene), PSPORTl plasmid (Life Technologies), PCDNA2.1 
plasmid (hivitrogen, Carlsbad CA), PBK<!MV plasmid (Stratagene), FCR2-TOPOTA plasmid 

25 (Ihvitrogen)* PCMV-ICIS plasmid (Stratagene), pIGEN (Incyte Genomics, Palo Alto CA), pRARE 
(Incyte Genomics), or pJNCY (Incyte CJenomics), or derivatives thereof. Recombinant plasmids 
were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5a, DHIOB, or ElectroMAX DHIOB from life Technologies, 
n. Isolation of cDNA Clones 

30 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using the UNEZAP vector system (Stratagene) or by cell lysis. Plasmids were purified 
using at least one of the following: a Magic or WIZARD Minipreps DNA purification system 
(Promega); an AGTC Miuprep purification kit (Edge Biosystems, Gaithersburg MD); and 
QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems 

35 or the R.EAX. PREP 96 plasmid purificatioii kit from QIAGEN. Following precipitation, plasmids 
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were resuspended iti 0. 1 ml of distilled water and stored, with or without lyophilizadon, at 4**C 

Alternatively, plasmid DNA was amplified from host cell lysates using diiect link PGR m a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

5 384-well plates, and the concentration of amplified plasmid DNA was quantified ftuorometricaUy 
using PICOGREBN dye (Molecular Probes, Eugene OR) and a ELUOROSKAN n fluorescence 
scanner (Labsystems Oy, Helsinki* Finland), 
m. Sequencing and Analysis 

Incyte dDNA recovered in plasmids as described in Example n were sequenced as follows. 

10 Sequencing reacticms wm processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or die PTC-200 thermal 
cycler (Ml Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were piepaxed 
using reagents pmvided by VSmersham Pharmacia Biotech or supplied in ABI sequencing kits such 

15 as the ABI PRISM BIGDYB Terminate cycle sequencing ready reaction Mt (Applied Biosystrais). 
Blectiophoietic separation of cDNA sequencing reactions and detecdcn of labeled polynucleotides 
were earned out using the MEOABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 

20 frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supia* unit 7.7). Son» of the cDNA sequences weie selected for extension usmg the. 
techniques disclosed in Bxan^le VIH 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

25 programs based on BLAST, dynamic programming* and dmucleotide nearest neighbor analysis. 
The Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, 
and BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
^agiens, Rattus norve|gicus, Mus musculus> Caenorhabdiris elepans, Saccharomv ces cerevisiae. 

30 Schizosaccharomvces pombe . and Candida albicans (Incyte Genomics, Palo Alto CA); and hidden 
Markov model (HMM)"bascd protein family databases such as PFAM. QQJM is a probabilistic 
approach which analyzes consensus primary stnictures of gene families. See, for exairqple, Eddy, 
S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) The queries were performed using programs 
based on BLAST, FASTA. BLIMPS, and HMMER. The Incyte cDNA sequences were assembled 

35 to produce full length polynucleotide sequences. Alternatively, CJenBank cDNAs, GenBank ESTs, 
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Stitched sequences, stretched sequences, or Genscan-predicted coding sequences (see Bxaniples IV 
and V) were used to extend Ihcy te cDNA assemblages to full length. Assembly was performed 
usmg programs based on I%red« Phrap, and Consed, and cDNA assemblages were screened for open 
reading frames using programs based on GeneMark, BLAST, and PASTA. The jEuIl length 

5 polynucleotide sequences were translated to derive the corresponding full length polypeptide 
sequences. Altamatively, a polypeptide of the invention may begin at any of the methionine 
residues of the ML length translated polypeptide. Full length polypeptide sequences were 
subsequently analyzed by querying against databases such as the GenBank procein databases 
(genpept), SwissProt, the PROTEOMB databases, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 

10 and hidden Markov model (HMM>-based protein family databases such as PFAM. Full length 
polynucleotide sequences are also analyzed using MACDNASIS FRO software (Hitacht Software 
Engineering. South San E^cisco CA) and LASERGENE software (DNASTAJt). Polynucleotide 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algmdim as incorpoiBicd into the MEGAIIGN multisequmce alignment program 

15 (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 sunnnarizes the tools, programs, and algorithms used for the analysis and assenibly 
of Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters* The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 

20 refoenceSf all of which are incorporated by reference herein in their entirety, and die fourth column 
presents, where ai^licable, the scores, probability values, and other parann^rs used to evaluate the 
strength of a match between two sequences (the highra- the sccH-e or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for tiie assembly and analysis of full length polynucleotide 

25 and polypeptide sequences were also used to identify polynucleotide sequence fragments from SBQ 
ID NO:33-64. Fragments fiom about 20 to about 4000 nucleotides which are usefid in hybridization 
and amplification technologies are described in Table 4, column 2. 
TV. Identification and Editing of Coding Sequences from Genonruc DNA 

Putative tiansporters and ion channels were initially identified by running the Genscan gene 

30 identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan 
is a general-purpose gene identification program which analy2:cs genomic DNA sequences from a 
variety of organisms (See Burge. C. and S. Karlin (1997) J. Mol Biol 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 

35 Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range 
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of sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
predicted cDNA sequences encode transporters and ion channels, the encoded polyp^ddes were 
analyzed by querying against PFAM noodels for transporters and ion channels. Potential 
transporters and ion channels were also identified by homology to Incyte cDNA sequences that had 
5 been annotated as transporters and ion chaimels. These selected Genscan-predicted sequences were 
then compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
Genscan-predicted sequences were then edited by comparison Co the top BLAST hit from genpept to 
conect errors in the sequence predicted by Qenscan, such as extra or omitted exons. BLAST ' 
analysis was also used to find any hicyte cDNA or public cDNA coverage of the C^scan-^piedicted 

10 sequences, thus providing evidence for transcription. When Ihcyte cDNA coverage was available, 
this infcnroation was used to cottect or confirm the Oenscan predicted sequ^ioe. RiU length 
polynucleotide sequences wm obtained by assembling Grascan-predicted coding sequences with 
Ihcyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Exaiqple BL Altemativelyi full length polynucleotide sequences were derived entirely from edited 

15 or unedited Genscan-fredicted coding sequences. 

V. Assembly of Genondc Sequence Data with cDNA Sequeace Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identificaticwi program described m Example IV. Partial cDNAs assenlbled as described in Example 

20 in were mapped to genomic DNA and parsed hito clusters containing related cDNAs and Gmscan 
exm predictions from one or more genomic sequ^ticeSt Each cluster was analyzed using an 
algorithm based on graph theory and dynamic prograirmimg to integrate cDNA and genomic 
inf ormation« generating possible splice variants that were subsequently confirmed, edited, or 
extended to create a fidl length sequence. Sequence intervals in which the entire length of the 

25 interval was present on more than one sequence in the cluster were identified, and intervals thus 
identified were considered to be equivalent by transitivity. For exanq)Ie, if an interval was present 
on a cDNA and two genomic sequences, then all three intervals were considered to be equivalmt 
niis process allows unrelated but consecutive genomic sequences to be brought together, bridged by 
cDNA sequence, bitervals thus identified were then "stitched" together by the stitching algorithm 

30 in the order that they appear along their parent sequences to gen^te the longest possible sequence, 
as well as sequence v^ants, Linka^ between intervals which proceed along one type of p^nt 
sequence (cDNA to cDNA or genomic sequence to genomic sequence) were given preference over 
linkages which change parent type (cDNA to genomic sequence). The resultant stitched sequences 
were translated and compared by BLAST analysis to the genpept and gbpri public databases. 

35 hicorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit from 
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genpepL Sequences wexe further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences ^ere extended to full length with an algorithm based on BLAST 
5 analysis. First, partial cDNAs assembled as described in Exan^le HI were queried against public 
databases such as the GenBank primate* rodent, mammalian, vertebrate, and eukaiyote databases 
using the BLAST program. The nearest G^ank protehi homolog was then compaied by BLAST 
analysis to ddier Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A cUmedc protem was g^^ed by using the resultant high-scoring segmmt pairs 

10 (HSPs) to nmp the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the ori^al GenBank protein homolog. The 
GenBank pioiein homolog, the cbimmc protein, or both were used as probes to search for 
bconologous genomic sequences from the public human g^ome databases. Partial DNA sequences 
weie therefore ^stcetdied** or extraded by the addition of homologous genome sequences. Ti» 

15 resultant stietched sequences were examined to detenmne whether it contained a complete gene* 
VI. Ouroinosraial Mapping TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 were compared with 
sequences IBrom the Incyte LIFESEQ database and public domain databases using BLAST and other 
in^lementations of the Smith-Watennan algorithm. Sequences from these databases that matched 

20 SBQ ID NO;33*64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Fhiap (Table 7). Radiadon hybrid and genetic mapping data available 
from public resources such as the Stanford Human Graiome Center (SHGC), Whitdiead Institute for 
Genome Research (WIGR), and G6n£thon used to determine if any of the clustered sequences 
had been previously mapped, inclusion of a mapped sequence in a cluster resulted in the 

25 assignment of all sequences of that clust^, including its particular SEQ ID NO:» to that map 
location. 

Map locations are represented by ranges, (»r intervals, of human chromosomes. The m^ 
position of an interval, in centiMorgans, is measured relative to the tenninus of the ctoomosome*s 
p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies 

30 between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of 
DNA in humans, although this can vary widely due to hot and cold spots of recombination.) Hie 
cM distances are based on genetic markers mapped by Gen^thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public* such as the N(3BI "OeneMap*99" World Wide 

35 Web site (http://www.ncbi.nbrunih.gov/genemap/), can be employed to detennine if previously 
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identified disease genes map within or in proximity to the intervals indicated above. 

Vn. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
5 RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; 

Ausubei (1995) supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 

related molecules in cDNA databases such as GenBank or UFBSEQ (Incyte Genomics). This 

analysis is much faster than multiple membrane-based hybridizations* In addition, the sensitivity of 
10 the con^uter search can be modified to detennine whether any particular match is categorized as 

exact or similar. The basis of the search is the product score, wbidi is defined as: 

BLAST Score x Percent Hentitv 
5 ji minimum {length(Seq. 1), I^gth(Seq. 2)} 

15 

The product score takes mto account both the degree of sinmarity between two sequences and the 
length of the sequence noatch. The product score is a nonnalized value between 0 and lOQ, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 tunes the Icngdi of the shorter of the two sequ^ces). The BLAST score is 

20 calculated by assigning a scm of +5 for every base that matches in a high-scoring segment pah: 
(HSP), and -4 for every misnmtch. Two sequraces may share more than one HSP (separated by 
gaps> ST there is more than one BSP, then the pair with the highest BLAST score is used to 
calculate the product scxm. The product score represents a balance between fractional overlap and 
quality m a BLAST alignm^t For example, a product score of 1(X) is produced only for 100% 

25 identity over the entire length of the shorter of the two sequences being compared. A product score 
of 70 is produced either by 100% identity and 70% overlap at one end, or by 88% identity and 100% 
overlap at the other. A product score of 50 is produced either by 100% identity and 50% overlsqp at 
one end, or 79% identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

30 tissue sources from which they were derived. For example, some full length sequences are 

assembled, at least in part, with overlapping Ihcyte cDNA sequences (see Example ffl). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective 
tissue; digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; 

35 genitalia, male; germ cells; hemic and immune system; liver, musculoskeletal system; nervous 
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system; pancreas; respiratory system; sense organs; skin; stomatognathic system; 
unclassified/ouxed; or urinary tract The number of libraries in each category is counted and 
divided by the total number of libraries across all categories. Similarly, each human tissue is 
classified into one of the following disease/condition categories: cancer, cell line, developmental, 

5 inflammation^ neurological, trauma, cardiovascular, pooled, and other, and the number of libraries 
in each category is counted and divided by the total number of libraries across all categories. The 
resulting percentages reflect the tissue- and disease-specific expression of cDNA encoding TRICH* 
cDNA sequences and cDNA library/dssue information are found in the UFBSEQ GOLD database 
(Ihcyte G^iomics, Palo Alto CA). 

10 Vm. Extendofi of TMCH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fiagment of the full length molecule using oligonucleotide primers designed fixmi this fragment 
One primer was synthesized to initiate 5* extension of the known fragment, and the other primer was 
synthesized to initiate 3*extension of the known fragment The initial primers were designed usmg 

15 OLIGO 4.06 software (National Biosciences), «: another appn^riate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to die target 
sequence at temperatures of about eS^'C to about 72'*C Any stretch of nucleotides which would 
result in hairpin stnictores and primer-primer dimerizations was avoided. 

Selected human cDNA Utearies were used to extend the sequence. If more than one 

20 extension was necessary or desired, additional or nested sets of primers were designed. 

Ifigh fidelity anf>lification was obtained by PGR using methods wdil known ni the art. PGR 
was performed in 96-^well plate$ using the PrC-200 thermal cycler (MJ Research, bkc.)- The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg?*, 
(NH4)jS04, and 2-meicaptoetiianol, Taq DNA polymerase (Amersham Pharmacia Biotech), 

25 ELONOASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene). witii the 

followmg parameters for primer pair PQ A and PQ B: Step 1: 94°C, 3 min; Step 2: 94 ^C, 15 sec; 
Step 3: 60^C, 1 min; Step 4: 68*^0, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: eS'^C, 
5 min; Step 7: storage at 4**C, In the alternative, the parameters for primer pair T7 and SK+ were as 
follows: Step 1: 94°C, 3 min; Step 2: 94X. 15 sec; Step 3: 5VC, \ min; Step 4: 68**C, 2 mm; Step 

30 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68**C, 5 min; Step 7: storage at 4^*0 

The concentration of DNA in each well was determined by dispensing 100 ill PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 fil of undiluted PGR product into each well of an opaque fluorimeter plate (Coming Costar. 
Acton MA), allowing the DNA to bind to tiie reagent The plate was scanned in a Huoroskan U 

35 (Labsystems Oy , Helsinki. Finland) to measure the fluorescence of the sample and to quantify the 
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concentration of DNA. A 5 m1 to 10 /il aliquot of the reaction mixtuie was analyzed by 
electrophoresis on a 1 % agarose gel to detemiine wUcb reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates* 

5 digested with Cvill cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels» fragments were excised, and agar digested with Agar ACB (Prooiiega). Extended 
clones were leligated usmg T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stiatageiie) to fill-in restriction 
site oveihangs, and transacted into conyetent R coli cells- Transftunmed cells wm selected on 
antibiotic-containing media, and mdividual colonies were picked and cultured ov^j^t at 37^C in 
384-well plates in LBy2x carb liquid media 

The cells wm lysed, and DNA was anq^lified by PCR using Taq DNA polymerase 

IS (Amersham Pharmacia Biotech) and Pfu DNA polymerase (SCratagene) with the following 

parameters: St^ 1: 94''C, 3 min; Step 2: 94''C, IS sec; Step 3: 60*^0, 1 ixun; Step 4: IZ^'C, 2 min; 
Step 5: steps 2, 3, and ^repeated 29 times; Step 6: 72T, S mm; Step 7: stora^ at 4*'C. DNA was 
quantified by PICOGKEEN reagent Molecular Probes) as described above. Samples with low 
DNA recov^es were rean^lified using the same conditions as described above. San^les were 

20 diluted with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYl^AMIC energy ttansfier 
sequencmg primers and the DYENAMC DKBCJT kit (Amersham Pbammcia Biotech) or the ABI 
PRISM BIGDYB Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotide sequences are verified usmg the above procedure 
or are used to obtain S*iegulatory sequences using the above procedure along with oligonucleotides 

2S designed for such extension, and an appropriate genomic library. 
DL Labeling and Use of Individual Hybridi^fion Probes 

Hybridization probes derived from SEQ ID NO;33-64 are employed to screen cDNAs, 
genomic DNAs, or toRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
softw^ (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of 
[y-^P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEK Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead colunui (Amersham Phannacia Biotech). 

35 An aliquot contaming 10^ counts per minute of the labeled probe is used in a typical membrane^ 
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based hybridization analysis of human genoinic D^A digested with one of the foUowmg 
endonudeases: Ase I, Bgl II. Bco RI, Pst I. Xba I, or Pvu It (DuPont MEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and tcansfened to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is cairied out for 16 
5 hours at 40''C To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization pattems are visualized using autoradiography or an alternative inuiging means and 
compared. 

X. Microarrays 

10 The linkage or synthesis of array elements upon amicroarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supm*), 
mechanical nodcrospotdng technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena 
(1999), supra). Suggested substmtes kclude silicon, silica, glass slides, glass cl^ps, and silicon 

IS wafers. Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and 
Imk elements to die surface of a substrate usmg themoal, UV, chemical, or mechanical bonding 
procedures. A ^ical array may be produced using available methods and machines weU known to 
those of ordinary skiU m the art and may contain any appropriate number of elemrats. (Sec, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, et al. (1996) Genome Res. 6:639-645; 

20 Marshall, A. and J. Hodgscm (1998) Nat Biotechnol. 16:27-31.) 

Full l^gth cDNAs, Expressed Sequence Tags (BSTs), or firagments or oligomers thereof 
may conq)rise the elements of the micioorray. Figments or oligomers suitable for hybridization 
can be selected using software well known in the art such as LASERGENE software (DNASTAR). 
The army elements are hybridized with polynucleotides m a biological sample. The polynucleotides 

25 in the biological sample are conjugated to a fluorescent label or other molecular tag for ease of 
detection. After hybridization, nonhybridized nucleotides from the biological san^Ie are removed, 
and a fluorescence scanner is used to detect hybridization at each array element. Alt^atively, laser 
desorbtion and mass spectrometry imy be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 

30 on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A)* RNA is purified using the oligo-(dT) cellulose method. Each poly(A)*^ RNA sample is 

35 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/jxl oligo-(dT) primer (21mer), IX 



82 



wo 02/46415 



PCT/USOl/46963 



first strand buffer, 0.03 iinits//il RNase inhibitor. 500 (M dATP, 500 /iM dGTP, 500 fiM dTTP, 40 
lOA dCTP, 40 ^tM dClP-Cy3 (BDS) or dCTP-CyS (Amersham Phannacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly (A) ^ RNA with 
GEMBRIOHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro 
5 transcription fiom non-coding yeast genomic DNA. After incubation at 3VC for 2 hr, each reaction 
sample (one with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium 
hydroxide and incubated for 20 minutes at 8S^C to the stop the reaction and degrade the RNA. 
Samples are purified using two successive CHROMA SHN 30 gel filtration spm columns 
(CLONTECH Laboratories, Inc. (CLONTECH). Palo Alto CA) and after combinmg, both reactioii 

10 samples are ethanol precipitated usmg 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 
wSl of 100% eihanol. The sample is then dried to completion using a SpeedVAC (Savant 
Instruments Lie, Holfarook NY) and lesuspended in 14 5X SSCyO.2% SD5. 
MBcroarrav Preparation 

Sequences of the present invention are used to genofate aixay elements. Each array element 

15 is amplified from bacterial cells containing vectors with cloned cDNA inserts. PGR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert Anay elem^ts are 
aiqpUfied in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater dian 5 
fig. Anq>lified array elements are then purified usmg SEPHAC31YL-400 (Amersham Pharmacia 
Biotech). 

20 Purified array elements are irnmolnUzedcNipolymerKxmted glass slid Glass microscope 

slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with ext^sive distilled water 
wadies between and afrer treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products C^orporation (VWR), West Qiest^ PA), washed extensively in distilled wat^, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanoL (boated slides are cured in a 

25 UOXJovcai. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807^22, incorporated herem by reference. 1 fd of the array element DNA» at an 
average concentration of 100 ng/fil, is loaded into the open capillary printing element by a high- 
speed robotic apparatus* The apparatus then deposits about 5 nl of array element sample per slide. 

30 Microarrays are UV-crosslmked using a STRATALINKER OV-crosslinker (Stratagene). 

Microarrays are washed at room ten^ratuie once in 0.2% SDS and three times in distilled wato:. 
Non-specific binding sites are blocked by incubation of microarrays in 0,2% casein in phosphate 
buffered saline (PBS) (Troplx, Inc., Bedford MA) for 30 minutes at dO^'C followed by washes in 
0.2% SDS and distilled water as before. 

35 Hybridization 
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Hybridization reactions contain 9 /il of sample mixture consisting of 0.2 fig each of Cy 3 and 
CyS labeled cDNA synthesis pioducts in SX SSC» 0.2% SDS hybridization buffer. The sample 
mixture is heated to 6S'^C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an L8 cndi^ cov^lip. The anays are transferred to a waterproof chamber having a cavity just 

5 slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 of SX SSC in a comer of the chamber. The chamber containing the aimy s is 
incubated for about 6.5 hours at 6(fC The arrays are washed for 10 mm at 45*'C m a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 mmutes each at 4S**C in a second wash buffer (0.1X 
SSQ, and dried. 

10 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
hmova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral 
lines at 488 run for excitation of Cy3 and at 632 nm for excitation of CyS. The excitation laser light 
is focused on the array using a 20X microscope objective (Nikon, Inc, Melville NY). The slide 

15 CGNOtaining the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1,8 cm array used m the present example is scanned with 
a resolution of 20 micromet^. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477» 

20 HamaniatsuPhotmics Systems, BridgewaterNJ) corresponding to the two fluorq>ho^^ 

Appropriate filt^ positioned between the array and die photomultiplier tubes axe used to filter the 
signals. The amission maxima of the fluorophores used are 565 nm for CyS and 650 nm for CyS. 
Each array is typically scanned twice, one scan per fluorophore usmg the ^>proi«iat& filters at the 
laser source, although die apparatus is capable of recording die spectm fiom bodi fluorophores 

25 simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location 
on the anay contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 : 100*000. Wh^ two 

30 san^Ies from different sources (e.g.» representing test and control cells), each labeled with a 
difibent fluorophore, are hybridized to a smgle array for the purpose of identifybg genes that are 
diffimntially expressed, the calibration is done by labeling sanq>Ies of the calibrating cDNA widi 
the two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of die photomultiplier tube is digitized using a 12-bit RTI-835H analog-to- 

35 digital (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM- 
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con]|)atible PC computer. The digitized data aie displayed as an image where the signal intensity is 
mapped using a linear 20-color transformaticMi to a pseudocolor scale ranging from blue (low 
signal) to red (high signal). The data is also analyzed quantitatively. Where two differrat 
fluorophores ace excited and measured simultaneously, the data aie first corrected fc^ optical 
S crosstalk (due to overlapping emission spectra) between the fluorophores using each fIuorophore*s 
emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluc»escence signal within each element is then 
integrated to obtain a numerical value corresponding to the average btensity of the signal. The 

10 software used for signal analysis is the GEMTTOOLS gene expression analysis program (Incyte). 
XI. ComplemrataryPoiyiiiideolides 

Sequences complementary to the TRICH-encoding sequences, or any parts th^of, are used 
to detect, decrease, or inhibit expression of naturafly occurring TRIGH. Although use of 
oligonucleotides comprising firom about IS to 30 base pairs is described, essmtiaUy the same 

IS procedure is used with smaller or with larger sequ^ce fragments. Appropriate olij^nucleotides are 
designed usbg OLIGO 4.06 software (National Biosci^ces) and the coding sequence of TRICH. 
To inhibit transcription* a complementary oligonucleotide is designed from the most unique 5' 
sequence and used to prevent promoter binding to the coding sequence. To mhibit translaticm, a 
coiiq>lementary oligonucleotide is designed to prevent ribosomal binding to the TRICH-encodmg 

20 transcript 

XIL Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or vims-based expression 
systems. For expression ofTRICH in bacteria, cDN A is subcloned into an approp 
attaining an antibiotic resistance gene and an inducible promote that directs high levels of cDNA 

25 transcription. Examples of such promoters include, but are not limited to, the trp-kw (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DB3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 

30 insect or mammalian ceU lines with recombinant Autograpbica califomica nuclear polyhedrosis 
vfans (AcMNPV)» commonly known as baculovirus. The nonessential polyhedrin gene of 
baculovirus is replaced with cDNA encoding TRICH by either homologous recombination or 
bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 

35 Recombinant baculovirus is used ID infect Spodoptera frugiperda (S£9) insect cells in most cases» or 
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human hepatocytes, in some cases. Infection of the latter requires additional genetic modifications 
to baculovirus. (See Engelhard, et al. (1994) Pmc. Natl. Acad. Sci. USA 91:3224-3227; 
Sandig, V. et al. (1996) Hum. Gene Ther. 7:19374945.) 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione 

5 S-bfansferase (GST) or a peptide epitope tag, such as FLAG or 6-His, pennitting rapid, single-step, 
afCnity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma japonicum. enables the purification of fusion proteins on 
immobilized glutathione undex conditions that maintain protein activity and antigenicity (Amersham 
Pbarmacia Biotech). Following purification, the GST moiety can be proteolydcally cleaved from 

10 TRICH at specifically engme^ed sites. HAG, an 8-amino add peptide, enables immunoaffinity 
purification using commercially available monoclonal and polyclonal anti-FLAO antibodies 
(Eastman Kodak). 6-Hi8, a stretch of six ccmsecutive histidme residues, enables purification on 
metal-chelate resins (QIAGBN). Methods for protein expression and putificatton are discussed in 
Ausubei (1995, supra, ch. 10 and 16). Purified TEUCH obtamed by these mediods can be used 

15 directly in the assays shown hi Examples XVI, XVH, and XVIH where applicable. 
Xm. Ainctionai Assays 

TTUCH function is assessed by expressmg the sequences encoding TRICH at 
physiologically elevated levels m mammalian cell culture systems. cDNA is subcloned into a 
mammalian eiqvression vector containing a strong promoter that drives high levels of cDNA 

20 expression. Vectors of choice include PCMV SPORT (Life Technologies) and PCaR3.1 (Invitrogen, 
C:arlsbad CA), both of which contam die cytomegalovirus promoter. 5-10 ^g of recombinant vector 
are ttansientiy transfected mto a human cell line, for example, an endothelial or hematopoietic cell 
line, using eith^ liposome formulaticma or electroporatiaii. 1-2 fig of m additional plasmid 
containing sequences encoding a marker protein are co-transfected ^pression of a marker protein 

25 provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g.. Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry 
(FCM)» an automated, laser optics-based technique^ is used to identify transfected cells expressing 
GFP or CX>64-GFP and to evaluate the apoptotic state of the cells and other ceUular properties. 

30 FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or 
coincident with cell death. These events include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuiidine uptake; alterations in expression of cell surface and intracellular 

35 proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 
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co0q>osition as measuted by the binding of fluorescein-<:onjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M.G, (1994) Flow Cytometry . 
Oxford, New Yoik NY. 

The influence of TRICH on gene expression can be assessed using highly purified 
5 populations of cells transfected with sequences encoding TRICH and either CD64 or CD64<iFP. 
CX>64 and CX>64'<3FP are expressed on the surface of transfected cells and bind to conserved 
legions of human inmmnoglobulin O (IgG). Transfected cells are efGciently separated ftom 
nontransfected cells using magnetic beads coated with either human IgO or antibody against CDM 
(DYNALt Lake Success NY). niRNA can be purified frcm the cells usmgnietfaodswenimown^ 
10 thoseof sKdll intheart BxpiessionofmRNA encoding TRICH and other genes of interest can be 
analyzed fay nordiem analysis or tmcroanay techniques. 

XIV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyaciylamide gel electrophoresis (PAGE; see» e.g., 
Harruigton, M.G. (1990) Methods Bnzymol. 182:488-495), or other purification techniques, is used 

15 to immunize rabUts and to produce antibodies using standard protocols. 

Alternatively, the TRICH ammo acid sequence is analyzed using LASERGENE software 
(DNASTAR) to detennine regions of high imnumogqEUcity, and a corresponding oligopeptide is 
synthesized and used to raise atitibodies by means known to those of skill m Methods for 

selection of appropriate epttq)es» such as those near the C-terminus or in hydrophilic regions are 

20 weU described in the art (See, e.g., Ausubel, 1995, ggga, ch. 11.) 

Typically, oligopeptides of about IS reddues in length ai:e synthesized usmg an ABI 43 lA 
peptide synttiesizer (Applied Biosystems) using PMOC dbemistcy and coiqpled to KLJH (Sigma^ 
Aldrich, St Louis NfO) by reaction with N-nialeimidobeiizoyl-N'^ydFaxysuccinimide ester (MBS) 
to increase unmunogenicity. (See, e*g., Ausubel, 1995, suprau ) Rabbits are immunized with the 

25 oligopeptide-KLH complex in complete Freund's adjuvant Resulting antisera are tested for 

antipeptide and anti-TRICH activity by » for exan^le, binding the peptide or TRICH to a substrate, 
blocking witii 1% BSA, reacting with rabbit antisera, washing, and reacting with radto-iodinated 
goat anti-rabbit IgG. 

XV. Punfication of Naturally Occurring TRICH Using SpeciHc Antibodies 

30 Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

chromatography using antibodies specific for TRICH: An immunoaffinity colmnn is constructed by 
covalentiy coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufactutefs instructions* 

35 Media containing TRICH aie passed over the immunoaffini^ column, and the column is 
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washed under condidoiis that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in tte presence of deteigent). The colunm is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion)* and TRICH is collected. 
5 XVL Identmcation of MolecidesWUch Interact ^tfaTRlC^ 

Molecules which interact with TRICH may include transporter substrates, agonists or 
antagonists, modulatory proteins such as GPy proteins (Reimann, supra^ or proteins involved in 
TRICH localization or clustering such as MAGUKs (Craven, supraV TRICH, or biologically active 
fragments diereof, are labeled with ^^I BoIton^Hunter reagent. (See, e.g.» Bolton A.B. and W*M. 

10 Hunter (1973) Biochem. X WSiSlP^-SSP.) Candidate molecules previously arrayed tn the wells of a 
multi-well plate are incubated with the labeled TRICH, washed^ and any wells with labeled TRICH 
complex are assayed Data obtained usmg diffeirat concentrations of TRICH are used to calculate 
values for the number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH axe isolated using the yeast 24iybrid system 

15 (E^elds, S. and 0. Song (1989) Nature 340:245-246)* TRICH. or fiagnmts thereof, are expressed as 
fusion proteins with the DNA binding domain of Gal4 or iexA, and pot^ttal interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fiision 
protein and the TRICH impacting proteins (fusion proteins with an activation domain) reconstitute 
a transactivation ftmction that is observed by e}q>ression of a reporter gene. Yeast 24iyhrid systems 

20 are conuneicially available, and methods for use of the yeast 2-hybrid system with ion channel 
proteins are discussed in Niethammer, M. and M Sbeng (1998» Meth. Kizymol. 293: 104-122). 

TRICH may also be used in the PATHCALLING process (CuiaGen Corp.. New Haven CI) 
which employs the yeast two4Lybrid system in a high-throug^ut manner to determine all 
interactions bettveen the proteins encoded by two large libraries of genes (Nandabalan. K. et al. 

25 (2000) US. Patent No. 6,057.101). 

Potential TRICH agonists or antagonists may be tested for activation or mhibition of 
TRICH ion channel activity using the assays described in section XVm 
XVn. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 

30 conductance. TRICH can be expressed by transfoiniing a mammalian cell line such as C0S7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH, Eukaiyotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art. A second plasmid which expresses any one of a number of marker genes, such as 
B-galactosidase, is co-transformed into the cells to allow rapid identification of those cells which 

35 have taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 
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transfonnation under conditions appropriate for the cell line to allow expression and accumulation 
of TRICH and B*galactosidase. 

Transformed cells expressing ft-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are weU known in the art. Stained cells 
5 are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art Untransformed cells, and/w celb transformed with either vector sequences alone 
or fi-galactosidase sequences alone, are used as controls and tested in paralleL CeUs expressing 
TRICH will have higher anion or cation conductance relative to control cells. Tlie contribution of 
TRICH to ccmductance can be confimied by incubating the cells using antibodies specific for 

10 TRICH. The antia}o£es will bind to the extraceUdar side of TRICH, tfaerd> 
the ion channel, and the associated conductance. 

Altonoatively , ion channel activity of TRICH is measured as currsat flow across a TRICH» 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi 
et al., Bunca; Je^ T. and L. Salkoff (1997) t Neurosci. 17:32-44). TRICH is subcloned into an 

15 appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of noRNA is injected into 
mature stage IV oocytes, bgected oocytes are incubated at 18 ^Cf^ 1-5 days. Jbiside-out 
macropatches are exdsed into an intracellular solution containing 116 ssM K-gluconale, 4 mM KCL 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 
of the TRICH mediator* such as cAMP, cGMP, or Ca^" (in the form of CaCI^)* ^l^ere appropriate. 

20 Electrode resistance is set at 2-5 MQ and electrodes are filled with the intracellular solution lackmg 
mediator. Bxperiments are perf onned at room ten^perature from a holding potential of 0 mV. 
Voltage ran^s {25 s) from -100 to 100 mV are acquired at a sanqpUng frequency of 500 Hz. 
Cunent measured is propcwtional to the activity of TRICH in the assay. In particular, the activity of 
TRICH-1 is measured as Na^ cooduct^Kie and the activity of TRICH-3 is measured as Ca^ 

25 conductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopns laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18*C in 0R2 medium (8l5mM NaQ, 2.5 mM KQ, ImM CaQ 
2» ImM MgQ 3, ImM N^JSeO^, 5 mM Hepes, 3.8 mM NaOH , 50jt4g/ml gentamycin, pH 7.8) to 

30 allow expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM 
NaCl,2mMKCl, ImMCaQ^, ImMMgOj, 10 mM Hqjes/Tris pH 7*5). Uptake of various 
substrates (e.g., amino acids» sugars, drugs, ions, and neurotransmitters) is initiated by adding 
labeled substrate (e-g. radiolabeled with ^H, fluorescently labeled with rhodaminc, etc.) to the 
oocytes. After incubating for 30 minutes, uptake is terminated by washing the oocytes three times 

35 in Na^-&ee medium, measuring the incorporated label, and comparing with controls. TRICH 
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activity is proportional to the level of internalized labeled substrate. Test substrates bclude ran- 
OTP for TRICH- 2, glucose for TRICH=5, amino acids for TRICH-6 and TRICH-14, cations for 
TRICH-7 and TRICH-16, Na\ K* and CI* ions for TRICH-15, reduced folate or analogues such as 
methotrexate for TRICH-17, divalent cations for TRICH-18, anions such as arsenate and antuncHiite 

5 for TRICH-19, and nitrate or oBgopeptides for 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled 
ATP-[y-^^]> separation of thet hydrolysis products by chromatographic methods, and quantitation of 
the recovmd ^ using a scintillation counter. The reaction nuxtiue contains A1P-[Y-^] and 
varymg amounts of TRICH m a suitable buffer incubated at yj^'C for a suitable p^od of time. The 

10 reaction is tmninated by acid precipitation with trichloroacetic acid and then neutralized with base, 
and an aliquot of the xeaction mixture is subjected to membrane or filter paper-based 
diromatography to separate the reaction products. The amount of ^ UbeiBted is counted in a 
scintillation counter. The amount of radioactivity recovmd is proportional to the ATPase activity 
of TRICH in the assay. 

15 Lipocalin activity of TRICH is measured by ligand fluorescence enhancemrat 

spectrofluorometry (Lin et al. (1997) Molecular Vision 3:17). Exaiq>les of ligands include retinol 
(Sigma, St Louis MO) and 16-antbiyloxy-pahnitic add (16-AF) (Molecular Rrobes Mc, Eugene 
OR). Ligand is dissolved in 100% ethanol and its concentradon is estimated using known 
extinction coefBcents (retinol: 46,000 A/M/cm at 325 nm; 16-AP: 8,200 A/M/cm at 361 nm). A 

20 700(ilaUquotof ljiMTRICHmlOmMTris(pH7.S),2mMB^ isplaced 
in a 1 cm path lengdiquart? cuvette and 1 ftlaliquots of ligand solution are added. Mucxescence is 
measured 100 seconds after each addition until readings are stable. Change in flufmscence per unit 
change in ligand concentration is proportional to TRICH activity. 
XVm. Identification of TRICH Agoiiists and Antagonists 

25 TRK3I is expressed in a eulcaryotic cell Ime such as CHO (Chinese Hamster Ovary) or 

HKlt (Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in 
the presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 
patch clamp methods well known in the art or as described m Example XVIL Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 

30 membrane (Velicclebi, G. et al. (1999) Meth. Enzymol 294:20-47; West, M.R, and C.R. Molloy 
(1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening 
using microplates. Changes m internal ion concentration arc measured using fluorescent dyes such 
as the Ca^* indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI" 
indicator MQAE (all available from Molecular Probes) in combination witii the HJOPR fluorimetric 

35 plate reading system (Molecular Devices). In a more generic version of this assay, changes in 
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membrane potential caused by ionic flux across the plasma membrane are measmed using oxonyl 
dyes such as DiBAQ (Molecular Probes), DiBAC4 equilibrates between the extraceUuIar solution 
and cellular sites according to the cellular meinbrane potential. The dye's fluorescence intensity is 
20-fold greater when bound to hydrophobic iotracellular sites, allowing detection of DiBAQ entry 

5 into the cell (Gonzalez, J^. and PA Negulescu (1998) Om. Opin. Biotechnol. 9:624-631), 
Candidate agonists or antagonists may be selected from known ion channel agonists or antagonists, 
peptide libraries, or combinatorial chemical libraries. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those sldlled in the art without departing from the scope and spirit of the 

10 invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly linuted to such specific 
embodiments. Indeed, various modifications of the d^ciibed modes for carrying out the invention 
which are obvious to diose skilled in molecular biology or related fiekls are intended to be within 
the scope of the fotlowing claims. 
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Table 5 



Polynucleotide SEQ 
ID NO: 


bcyte Project ID: 


Representative Library 


33 


7484831CB1 


LIVRNON08 


34 


2477266CB1 


LIVRNON08 


35 


3552033CB1 


BRAHTDKOl 


36 


4778I39CB1 


PR0STUS19 


37 


4787433CB1 


PGANNOIDI 


38 


7483598CB1 


BRAITUT29 


39 


7484823CB1 


TESTNOCOl 


40 


143935CB1 


BRACDIK08 


41 


5923789CB1 


BRAIFEr02 


42 


6046484CB1 


BRACNOK02 


44 


7483595CB1 


TESTNOCOl 


45 


3788427CBI 


BONEUNROl 


46 


6972455CB1 


BMARUNR02 


47 


8077668CB1 


ADRETUEQ2 


48 


55120485CB1 


BRAITUT29 


49 


3112883CB1 


BRSTNOTOS 


50 


4253888CB1 


ADRBTUE02 


52 


7483850CB1 


lJVRDm)6 


53 


5508353CBX 


NBRDTDNOS 


54 


8543628CB1 


BMARtJNR02 


55 


7482754CB1 


PTHYTMN05 


56 


3794818CB1 


KIDEUNE02 


57 


4717525CB1 


KIDEXJNE02 


58 


5091793CB1 


LUNGTUT08 


59 


5945527CB1 


SINTNOROI 


60 


694n24CBl 


FTOBTUROl 


61 


6972530CB1 


BMARUNR02 


62 


6991750CB1 


&KAIFER06 


63 


71726948CB1 


KIDNNOT32 
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What is claimed is: 

L An'isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from die group 
5 consisting of SEQ ID NO: 1-32, 

b) a polypeptide con^rising a naturally occuiring amino acid sequence at least 90% 
identical to an an^o acid sequence selected from the group consisting of SEQ ID 
NO:l-32, 

c) a biolo^cally active fragment of a polyp^tide having an amino acid sequence 
10 selected from the groiq) consisting of SEQ JD NO: 1-32, and 

d) an immunogenic fragment of a polypeptide having an andno acid sequence selected 
from the group consisting of SEQ ID NO:l-32. 

2. An isolated polypeptide of claim 1 comprising im amino acid sequel 
15 group consisting of SBQ ID NO: 1-32. 

3. An isolated polynucleotide encoding a polypeptide of claim L 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:33-64, 

6. A recombinant polynucleotide conqnising a promoter sequence operably linked to a 
25 polynucleotide of claims. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 

9. A method of producing a polypeptide of claim 1» the method con^rising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transfomaed with a recombinant polynucleotide* md said recombinant 
polynucleotide coiiq>rises a promoter sequence operably linked to a polynucleotide 
35 encoding the polypeptide of claim 1, and 
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b) recovering the polypeptide so expressed. 

10. A metbod of claim 9, wherein the polypeptide conqprises an amino acid sequence 
selected fixim the group consisting of SEQ ID NO: 1-32. 

11. An isolated antibody which specifically binds to a polypeptide of claim L 

12. An isolated polynucleotide selected from the groiq) consisting of: 

a) a polynucleodde cDtnprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64» 

b) a polynucleotide comprising a naturally occuning polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of 
SEQE)NO:33-64, 

c) - a polynucleodde complementary to a polynucleotide of a), 

d) a polynucleotide conq>lenientaiy to a polynucleotide of bX and 

e) an RNA equivalent of a)-d). 

13. An isolated polynucleotide conq>rising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a san^le, said taigjet polynudeotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sanqple \^th a probe comprisiDig at least 20 contiguous nucleotides 
comprising a sequence complemeiitaiy to said target polynucleotide in the sanq>ley 
and which probe specifically hybridizes to said target polynucleotide, under 
conditions whereby a hybridization complex is formed between said probe and said 
taiget polynucleotide or fragments thereof^ and 

b) detecting the presence or absence of said hybridisation complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14^ wherein die probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method coirprising: 

a) amplifying said target polynucleotide or firagment thereof using polymerase chain 
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xeaction ampUfication, and 
b) detecting the presence or absence of said amplified target polynucleotide or 
fraginent thereof, and, optionally, if present^ the amount thereof. 

5 17. A con^sition comprising a polypeptide of claim 1 and a phaimaceuticany acceptable 

excipient 

IS. A con^Kksition of claim 17. wherein the polypeptide comprises an anuno acid sequence 
sdected from the group consisting of SEQ ID NO:l-32. 

10 

19. A method for treating a (tisease or condition associated with decreased expression of 
functional TRICH» conpising administ^g to a patient in need of such treatment the 
composition of claim 17. 

IS 20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 

claim 1, the method compridng: 

a) exposing a sanq[>Ie comprising a polypeptide of claim 1 to a conqiound, and 

b) detecting agonist activity in the sample. 

20 21. A con^Kisition comprising an agonist cotapamd identified by a method of claim 20 and 

a phannaceuticaliy acceptable ^ipient 

22. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment a 

25 composition of claim 21 . 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1. the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
30 b) detecting antagonist activity in the sample. 

24. A con^osidon comprising an antagonist compound identified by a method of claim 23 
and a phannaceuticaliy acceptable excipient. 

35 23. A method for treating a disease or condition associated with overexpression of 
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functional TRICH» cooqffising administering to a patient in need of such treatment a 
composition of claim 24. 

26. A method of soeening for a confound that specifically binds to the polypeptide of 
5 clahn 1, the method comprising: 

a) combbing the polypeptide of claim 1 with at least one test compound under 
suitable conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test con^und. thereby 
identifyhig a confound that specifically binds to the polypq>tide of claim 1. ~ 

10 

27. A method of screening for a compound that modulates the activity of the polype 
claim 1, the metfaod couoiHising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
conditions permissive for the activity of the polypeptide of claim 1, 
15 b) assessing the activity of the polypeptide of claim 1 in the presence of the test 

compound, and 

c) comparing the activity of the polypeptide of claim 1 in the pies^ce of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
con^und. wherein a change in the activity of the polypeptide of claim 1 in the 

20 presence of the test compound is indicative of a coxi4>ound that modulates the 

activity of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the 

25 method comprising: 

a) exposing a san^le con^rising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
30 amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological san^le containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sanq)le with a probe 

35 comprising at least 20 contiguous nucleotides of a polynucleotide of claim 12 under 
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conditions whereby a specific bybridi2ation complex is fozroed between said probe 
and a target polynucleotide in the biological sample, said target polynucleotide 
comprising a polynucleotide sequence of a polynucleotide of claim 12 or fragment 
thereof, 

c) quantifying the amount ofhybridizationcomplex> and 

d) comparing the amount of hybridization complex in the treated biological sample 
with the amount of hybridisation con^lex in an untreated biological sample, 
wherein a difference in the amount of hybridization coiiq>lex in the treated 
biological sample is indicative of toxicity of the test con^und. 



10 



30. A diagnostic test for a condition or disease associated with the expression of TRICH in 
a biological sample, the method con^rising: 

a) combining the biological sample with an antibody of claim 11, under conditions 
suitable for the antibody to bind the polypeptide and form an antibody :polypeptide 

15 conspkx, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample^ 

31, Tbeantibody of claim 11, wherein the antibody is: 
20 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab*)2 fragment, or 

e) a humanized antibody. 



25 



32. A composition coiq>rising an antibody of claim 11 and an acceptable excipient. 



33. A method of diaposing a condition or disease associated with the expression of 
TRICH in a subject* comprising administering to said subject m effective amount of the 

30 composition of claim 32. 

34. A conoposition of claim 32, wherein the antibody is labeled. 



35. A method of diagnoshig a condition or Asease associated widi tiie expression of 
35 TRICH in a subject, comprising administering to said subject an effective amount of the 
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compofiitioii of claim 34. 

36. A xnediod of pzeparing a polyclonal antibody with the specificity of the antibody of 
claim 1 1, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an anuno acid sequence 
selected from the groop consisting of SEQ n> NO: 1-32, or an immunogenic 
fragment theceof , under conditions to elicit an antibody response^ 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a 
polyclonal antibody which binds specifically to a polypeptide comprising an amino 
acid sequ^ce selected from the group consisting of SEQ ID NO:l-32. 

37. A polyclonal antibody produced by a method of claim 36. 

38* A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with die specificity of the antibody of 
claim 1 1, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the ^up consisting of SEQ ID NO: 1-32, or an immunogenic 
fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing colls from the animal, 

c) frising the antibody producing cells with immortalized ceUs to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which binds specifically to a 
polypeptide comprising an amino acid sequence selected from the group consisting 
ofSEQIDNO:l-32. 

40. A monoclonal antibody produced by a method of claim 39. 

4L A conqposidon comprising the monoclonal antibody of claim 40 and a suitable carrier. 



42. The antibody of claim U» wherein the antibody is produced by screening a Fab 
expression library. 
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43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A n^thod of detecting a polypeptide comprising an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: 1-32 in a sample, the method comprismg: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) detecting specific binding* wherein specific binding indicates the pies^ce of a 
polypeptide comprising an amino acid sequence selected fsom the group consisting 

10 ofSEQmNO:l-32inthesanq>le. 



45. A method of purifying a polypeptide comprismg an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32 from a sample, the method con^^rising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
15 specific binding of the antibody and the polypeptide, and 

b) separating the antibody from the sanq>Je and obtaining the purified polypeptide 
comprismg an amino acid sequence selected from the group consisting of SEQ JD 
NO:l-32. 

20 46. A microaixay wherein at least one element of the microarray is a polynucleotide of 

claim 13. 



47. A method of generatiiig an expression profile of a sample which contains 
polynucleotides, the method conpising: 

2S a) labeling the polynucleotides of the sample, 

b) contacting the elem^ts of ttie microanay of clahn 46 with the labeled 
polynucleotides of the san^le under conditions suitable for the formation of a 
hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

30 

48. An array compising different nucleotide molecules affixed in distinct physical 
locations on a solid substrate, wherein at least one of said nucleotide molecules comprises a 
first oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 

contiguous nucleotides of a target polynucleotide, and wherein said tai^^t polynucleotide is 
35 a polynucleotide of claim 12. 
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49. An amy of claim 48, wherein said first oligonucleodde or polynucleotide sequence is 
completely conq>lenientaiy to at least 30 contiguous nucleotides of said target 
polynucleotide. 

5 50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

coiq)leteIy con^Iemeaitary to at least 60 contiguous nucleotides of said target 
polynucleotide. 

51. An array of claun 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 coilv>letely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, finrther con^rising said target polynucleotide hybridized to a 
15 nucleotide molecule conq>ri$ing said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

20 55. An array of claim.48^ wherein each distinct physical location on the substrate contains 

multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct 
physical locatioii have the same sequence, and each distinct physical location on the 
substrate contains nucleotide molecules having a sequence which differs fixim the sequence 
of nucleotide molecules at another distinct physical location on the substrate. 

25 

56. A polypeptide of claim 1, contprising the amino acid sequence of SEQ ID N0:1. 

57. A polypeptide of claim 1, comq^rising the anuno acid sequence of SEQ ID NO:2. 
30 58. A polypeptide of claim 1» comprising the amino acid sequence of SEQ ID N0:3. 

59. A polypeptide of claim 1, con^rising the amino acid sequence of SEQ ID N0:4. 

60. A polypeptide of claim 1, conq>rising the ammo acid sequence of SEQ ID N0:5. 

35 
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61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:6. 

62. A polypeptide of claim 1^ comprising the amino acid sequence of SEQ ID N0:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:8. 

64. A polypeptide of claim 1, cofq[>rising the amino acid sequence of SEQ ID N0:9. 

65. A polypeptide of claim 1, con^rising the amino acid sequence of SEQ ID NO: 10. 

66. A polypeptide of claim I» comprising the amino acid sequence of SEQ ID NO:ll. 

67. A polypqjtide of claim 1, conq[>rising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1» coo^rising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the ammo acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ n> N0:21. 

77. A polypeptide of claim 1» comprising the ammo acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 
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79. A polypeptide of claim 1, ccHi^rising the amino acid sequence of SEQ E> NO:24* 

80. A polypeptide of claim 1« coopising the amino acid sequence of SEQ ID N0:2S. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ E> NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1. cooqprising the anuno acid sequence of SEQ ID NO:29. 
85* A polypeptide of claim 1, cottprising the amino acid sequence of SEQ ID NO:30. 

15 86. A polypeptide of claim 1« comprising the amino acid sequence of SEQ ID NO:3L 

87. A polypeptide of claim 1» con^rising the ammo acid sequence of SEQ ID NO:32. 

88* A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:33* 

89. A polynucleotide of claim 12» cQmpxistng the polynucleotide sequence of SEQ ID 
NO:34. 

25 90. A polynucleotide of clahn 12, contprising the polynucleotide sequ^ce of SEQ ID 

NO:35. 

91. A polynucleotide of claim 12, conq[)rising the polynucleotide sequence of SEQ ID 
NO:36. 

30 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:38. 
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94. A polynucleotide of claim 12» comprising the polynucleotide sequence of SEQ ID 
NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:40. 

96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO;4L 

10 97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ JD 

NO:42* 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO;43. 

15 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:44. 

100. A polynucleotide of claim 12. comprising Uie polynucleotide sequence of SEQ ID' 
20 NO:45. 

lOL A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:46. 

25 102. A polynucleotide of claim 12, con^sing the polynucleotide sequence of SEQ ID 

NO:47. 

103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:48. 

30 

104* A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:49. 



35 



105. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ro 
NO:50. 

142 



wo 02/46415 PCT/U$01/46963 

106. A polynucleotide of claim 12, con^rising the polynucleotide sequence of SEQ ID 
N0:51. 

107. A polynucleotide of claim 12, conq>iising the polynucleotide sequence of SBQ ID 
5 NO:52. 

108. A polynucleotide of claim 12, conqxrising the polynucleotide sequrace of SEQ ID 
NO:53. 

10 109. A polynucleotide of claim 12, con^rising the polynucleotide sequ^ce of SBQ ID 

NO:54. 

1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:55. 

15 

1 1 1 . A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:56. 

1 11 A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:57. 

1 13. A polynucleotide of claim 12, comprismg the polynucleotide sequence of SEQ ID 
NO:58. 

25 1 14. A polynucleotide of claim 12, con^rising the polynucleotide sequence of SEQ ID 

NO:59. 

lis. A polynucleotide of claim 12, con^rising the polynucleotide sequence of SEQ ID 

N0:60. 

30 

1 16. A polynucleotide of clatan 12, comprising the polynucleotide sequence of SEQ ID 
N0:61. 



35 



1 17. A polynucleotide of claim 12, comprismg the polynucleotide sequrace of SEQ ID 
NO:62. 
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1 1 8. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:63. 

1 19. A polynucleotide of claim 12, conpising the polynucleotide sequence of SEQ ID 
5 NO:64. 



<110> INCYTE GENOMICS, INC. 
LEE, Ernestine A. 
BAUGHN, Mariah R. 
YUE, Henry 
DING, Li 

RAUMANN, Brigette E. 
HAFALIA, April J. A, 
KHAN, Far rah A. 
NGUYEN, Danniel B. 
ELLIOTT, ViCkx S. 
RAMKUMAR, Jayalaxmi 
WALIA, Narinder K. 
ISOW, Craig H. 
LU, Yan 

GANDHI, Ameena E. 
WARREN, Bridget a. 
DUGGAN, Brendan M. 
TRIBOULEY, Catherine M, 
BURFORD, Neil 
LU, Dyung Aina M. 
LAL, Preeti 
YAO, Monique G. 
XU, Yiiming 
BRUNS, Christopher 
THANGAVELU, Kavitha 
SWARNAKAR, Anita 
TANG, Y. Tom 
AZIMZAI, Yalda 
THORNTON, Michael 
ARVIZU, Chandra 
POLICKY, Jennifer 

<120> TRANSPORTERS AND ION CHANNELS 

<130> pi-0314 vcr 

<140> To Be Assigned 
<141> Herewith 

<150> 60/254,303; 60/256,190; 60/257,504; 60/261,546; 60/262,832; 

60/264,377; 60/266,019 
<151> 2000-12^08; 2000-12-15; 2000-12-21; 2001-01-12; 2001-01-19? 

2001-01-26; 2000-02-02 

<160> 64 

<170> PERL Program 

<210> 1 
<211> 726 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc.feature 

<223> Incyte ID No: 7484831CD1 

<400> 1 

Met Glu Pro Gly Asp Ala Ala Arg Pro Gly Ser Gly Arg Ala Thr 
15 10 15 

Gly Ala Pro Pro Pro Arg Leu Leu Leu Leu Pro Leu Leu Leu Gly 
20 25 30 

Trp Gly Leu Arg Val Ala Ala Ala Ala Ser Ala Ser Ser Ser Gly 
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35 










40 










45 


Ala 


Ala 


Ala 


Glu 


Asp 
50 


Ser 


Ser 


Ala 


Met 


Glu 
55 


Glu 


Leu 


Ala 


Thr 


Glu 
60 


Lys 


Glu 


Ala 


Glu 


Glu 
65 


Ser 


His 


Arg 


Gin 


Asp 
70 


Ser 


Val 


Ser 


Leu 


Leu 

75 


Thr 


Phe 


lie 


Leu 


Leu 
80 


Leu 


Thr 


Leu 


Thr 


He 
85 


Leu 


Thr 


He 


Trp 


Leu 
90 


Phe 


Lys 


His 


Arg 


Arg 
95 


Val 


Arg 


Phe 


Leu 


His 
100 


Glu 


Thr 


Gly 


Leu 


Ala 
105 


Met 


He 


Tyr 


Gly 


Leu 
110 


He 


Val 


Gly 


val 


He 
115 


Leu 


Arg 


Tyr 


Gly 


Thr 
120 


Pro 


Ala 


Thr 


Ser 


Gly 
125 


Arg 


Asp 


Lys 


Ser 


Leu 
130 


Ser 


Cys 


Thr 


Gin 


Glu 
135 


Asp 


Arg 


Ala 


Phe 


Ser 
140 


Thr 


Leu 


Leu 


Val 


Asn 
145 


Val 


Ser 


Gly 


Lys 


Phe 
150 


Phe 


Glu 


Tyr 


Thr 


Leu 
155 


Lys 


Gly 


Glu 


He 


Ser 
160 


Pro 


Gly 


Lys 


He 


Asn 

165 


Ser 


Val 


Glu 


Gin 


Asn 
170 


Asp 


Met 


Leu 


Arg 


Lys 

175 


Val 


Thr 


Phe 


Asp 


Pro 
180 


Glu 


Val 


Phe 


Phe 


Asn 
185 


He 


Leu 


Leu 


Pro 


Pro 
190 


He 


He 


Phe 


His 


Ala 
195 


Gly 


rpyr 


ser 


Leu 


Lys 
200 


Lys 


Arg 


His 


Phe 


Phe 
205 


Arg 


Asn 


Leu 


Gly 


Ser 
210 


He 


Leu 


Ala 


Tyr 


Ala 
215 


Phe 


Leu 


Gly 


Thr 


Ala 
220 


Val 


Ser 


Cys 


Phe 


He 
225 


He 


Gly 


Asn 


Leu 


Met 
230 


Tyr 


Gly 


Val 


Val 


Lys 
235 


Leu 


Met 


Lys 


He 


Met 
240 


Gly 


Gin 


Leu 


Ser 


Asp 
245 


Lys 


Phe 


Tyr 


Tyr 


Thr 
250 


Asp 


Cys 


Leu 


Phe 


^e 

255 


Gly 


Ala 


He 


He 


Ser 
260 


Ala 


Thr 


Asp 


Pro 


Val 

265 


Thr 


Val 


Leu 


Ala 


He 
270 


Phe 


Asn 


Glu 


Leu 


His 
275 


Ala 


Asp 


Val 


Asp 


Leu 
280 


Tyr 


Ala 


Leu 


Leu 


Phe 
285 


Gly 


Glu 


Ser 


Val 


Leu 
290 


Asn 


Asp 


Ala 


Val 


Ala 
295 


He 


Val 


Leu 


Ser 


Ser 
300 


Ser 


He 


Val 


Ala 


Tyr 
305 


Gin 


Pro 


Ala 


Gly 


Leu 
310 


Asn 


Thr 


His 


Ala 


Phe 
315 


Asp 


Ala 


Ala 


Ala 


Phe 
320 


Phe 


Lys 


Ser 


Val 


Gly 
325 


He 


Phe 


Leu 


Gly 


He 
330 


Phe 


Ser 


Gly 


Ser 


Phe 

335 


Thr 


Met 


Gly 


Ala 


Val 
340 


Thr 


Gly 


Val 


val 


Thr 
345 


Ala 


Xieu 


val 


Thr 


Lys 
350 


Phe 


Thr 


Lys 


Leu 


His 
355 


Cys 


Phe 


Pro 


Leu 


Leu 
360 


Glu 


Thr 


Ala 


Leu 


Phe 
365 


Phe 


Leu 


Met 


Ser 


Trp 
370 


Ser 


Thr 


Phe 


Leu 


Leu 

375 


Ala 


Glu 


Ala 


Cys 


Gly 
380 


Phe 


Thr 


Gly 


val 


Val 
385 


Ala 


val 


Leu 


Phe 


Cys 
390 


Gly 


He 


Thr 


Gin 


Ala 

395 


His 


Tyr 


Thr 


Tyr 


Asn 
400 


Asn 


Leu 


Ser 


val 


Glu 
405 


Ser 


Arg 


Ser 


Arg 


Thr 
410 


Lys 


Gin 


Leu 


Phe 


Glu 
415 


Val 


Leu 


His 


Phe 


Leu 
420 


Ala 


Glu 


Asn 


Phe 


He 
425 


Phe 


Ser 


Tyr 


Met 


Gly 
430 


Leu 


Ala 


Leu 


Phe 


Thr 
435 


Phe 


Gin 


Lys 


His 


Val 
440 


Phe 


Ser 


Pro 


lie 


Phe 
445 


He 


He 


Gly 


Ala 


Phe 
450 


Val 


Ala 


He 


Phe 


Leu 

455 


Gly 


Arg 


Ala 


Ala 


His 
460 


He 


Tyr 


Pro 


Leu 


Ser 
465 


Phe 


Phe 


Leu 


Asn 


Leu 
470 


Gly 


Arg 


Arg 


His 


Lys 
475 


He 


Gly 


Trp 


Asn 


Phe 
480 


Gin 


His 


Met 


Met 


Met 
485 


Phe 


Ser 


Gly 


Leu 


Arg 
490 


Gly 


Ala 


Met 


Ala 


Phe 
495 


Ala 


Leu 


Ala 


He 


Arg 
500 


Asp 


Thr 


Ala 


Ser 


Tyr 
505 


Ala 


Arg 


Gin 


Met 


Met 
510 
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Phe Thr Thr Thr Leu Leu He Val Phe Phe Thr Val Trp He He 

515 520 525 

Gly Gly Gly Thr Thr Pro Met Leu Ser Trp Leu Asn He Arg Val 

530 535 540 

Gly Val Glu Glu Pro Ser Glu Glu Asp Gin Asn Glu His His Trp 

545 550 555 

Gin Tyr Phe Arg Val Gly Val Asp Pro Asp Gin Asp Pro Pro Pro 

560 565 570 

Asn Asn Asp Ser Phe Gin Val Leu Gin Gly Asp Gly Pro Asp Ser 

575 580 585 

Ala Arg Gly Asn Arg Thr Lys Gin Glu Ser Thr Trp He Phe Arg 

590 595 600 

Leu Trp Tyr Ser Phe Asp His Asn Tyr Leu Lys Pro He Leu Thr 

605 610 615 

His Ser Gly Pro Pro Leu Thr Thr Thr Leu Pro Ala Trp Cys Gly 

620 625 630 

Leu Leu Ala Arg Cys Leu Thr Ser Pro Gin Val Tyr Asp Asn Gin 

635 640 645 

Glu Pro Leu Arg Glu Glu Asp Ser Asp Phe He Leu Thr Glu Gly 

650 655 660 

Asp Leu Thr Leu Thr Tyr Gly Asp Ser Thr Val Thr Ala Asn Gly 

665 670 675 

Ser Ser Ser Ser His Thr Ala Ser Thr Ser Leu Glu Gly Ser Arg 

680 685 690 

Arg Thr Lys Ser Ser Ser Glu Glu Val Leu Glu Arg Asp Leu Gly 

695 700 705 

Met Gly Asp Gin Lys Val Ser Ser Arg Gly Thr Arg Leu Val Phe 

710 715 720 

Pro Leu Glu Asp Asn Ala 

725 



<210> 2 

<211> 1081 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> roisc_feature 

<223> Incyte ID No: 2477266CD1 



<400> 2 

Met Glu Ser Ala Gly Leu Glu Gin Leu Leu Arg Glu Leu Leu Leu 
15 10 15 

Pro Asp Thr Glu Arg He Arg Arg Ala Thr Glu Gin Leu Gin He 
20 ' 25 30 

Val Leu Arg Ala Pro Ala Ala Leu Pro Ala Leu Cys Asp Leu Leu 
35 40 45 

Ala Ser Ala Ala Asp Pro Gin He Arg Gin Phe Ala Ala Val Leu 
50 55 60 

Thr Arg Arg Arg Leu Asn Thr Arg Trp Arg Arg Leu Ala Ala Glu 
65 70 75 

Gin Arg Glu Ser Leu Lys Ser Leu He Leu Thr Ala Leu Gin Arg 
80 85 90 

Glu Thr Glu His Cys Val Ser Leu Ser Leu Ala Gin Leu Ser Ala 
95 100 105 

Thr He Phe Arg Lys Glu Gly Leu Glu Ala Trp Pro Gin Leu Leu 
110 115 120 

Gin Leu Leu Gin His Ser Thr His Ser Pro His Ser Pro Glu Arg 
125 130 135 

Glu Met Gly Leu Leu Leu Leu Ser Val Val Val Thr Ser Arg Pro 
140 145 150 

Glu Ala Phe Gin Pro His His Arg Glu Leu Leu Arg Leu Leu Asn 
155 160 165 
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Glu Thr Leu Gly Glu VaX Gly Ser Pro Gly Leu Leu Phe Tyr Ser 

170 175 180 

Leu Arg Thr Leu Thr Thr Met Ala Pro Tyr Leu Ser Thr Glu Asp 

185 190 195 

Val Pro Leu Ala Arg Met Leu Val Pro Lys Leu lie Met Ala Met 

200 205 210 

Gin Thr Leu He Pro He Asp Glu Ala Lys Ala Cys Glu Ala Leu 

215 220 225 

Glu Ala Leu Asp Glu Leu Leu Glu Ser Glu Val Pro Val He Thr 

230 235 240 

Pro Tyr Leu Ser Glu Val Leu Thr Phe Cys Leu Glu Val Ala Arg 

245 250 255 

Asn Val Ala Leu Gly Asn Ala He Arg He Arg He Leu Cys Cys 

260 265 270 

Leu Thr Phe Leu Val Lys Val Lys Ser Lys Ala Leu Leu Lys Asn 

275 280 285 

Arg Leu Leu Pro Pro Leu Leu His Thr Leu Phe Pro He Val Ala 

290 295 300 

Ala Glu Pro Pro Pro Gly Gin Leu Asp Pro Glu Asp Gin Asp Ser 

305 310 315 

Glu Glu Glu Glu Leu Glu He Glu Leu Met Gly Glu Thr Pro Lys 

320 325 330 

His Phe Ala Val Gin Val Val Asp Met Leu Ala Leu His Leu Pro 

335 340 345 

Pro Glu Lys Leu Cys Pro Gin Leu Met Pro Met Leu Glu Glu Ala 

350 355 360 

Leu Arg Ser Glu Ser Pro Tyr Gin Arg Lys Ala Gly Leu Leu Val 

365 370 375 

Leu Ala Val Leu Ser Asp Gly Ala Gly Asp His He Arg Gin Arg 

380 385 390 

Leu Leu Pro Pro Leu Leu Gin He Val Cys Lys Gly Leu Glu Asp 

395 40O 405 

Pro Ser Gin Val Val Arg Asn Ala Ala Leu Phe Ala Leu Gly Gin 

410 415 420 

Phe Ser Glu Asn Leu Gin Pro His He Ser Ser Tyr Ser Arg Glu 

425 430 435 

Val Met Pro Leu Leu Leu Ala Tyr Leu Lys Ser Val Pro Leu Gly 

440 445 450 

His Thr His His Leu Ala Lys Ala Cys Tyr Ala Leu Glu Asn Phe 

455 460 465 

Val Glu Asn Leu Gly Pro Lys Val Gin Pro Tyr Leu Pro Glu Leu 

470 475 480 

Met Glu Cys Met Leu Gin Leu Leu Arg Asn Pro Ser Ser Pro Arg 

485 490 495 

Ala Lys Glu Leu Ala Val Ser Ala Leu Gly Ala He Ala Thr Ala 

500 505 510 

Ala Gin Ala Ser Leu Leu Pro Tyr Phe Pro Ala He Met Glu His 

515 520 525 

Leu Arg Glu Phe Leu Leu Thr Gly Arg Glu Asp Leu Gin Pro Val 

530 535 540 

Gin He Gin Ser Leu Glu Thr Leu Gly Val Leu Ala Arg Ala Val 

545 550 555 

Gly Glu Pro Met Arg Pro Leu Ala Glu Glu Cys Cys Gin Leu Gly 

560 565 570 

Leu Gly Leu Cys Asp Gin Val Asp Asp Ala Asp Leu Arg Arg Cys 

575 580 585 

Thr Tyr Ser Leu Phe Ala Ala Leu Ser Gly Leu Met Gly Glu Gly 

590 595 600 

Leu Ala Pro His Leu Glu Gin He Thr Thr Leu Met Leu Leu Ser 

605 610 615 

Leu Arg Ser Thr Glu Gly He Val Pro Gin Tyr Asp Gly Ser Ser 

620 625 630 

Ser Phe Leu Leu Phe Asp Asp Glu Ser Asp Gly Glu Glu Glu Glu 
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635 










640 








645 


Glu 


Leu 


Met 


Asp 


Glu 
650 


Asp 


Val 


Glu 


Glu 


Glu 

655 


Asp 


Asp 


Ser 


Glu He 
660 


Ser 


Gly 


Tyr 


Ser 


Val 
665 


Glu 


Asn 


Ala 


Phe 


Phe 
670 


Asp 


Glu 


Lys 


Glu Asp 
675 


Thr 


Cys 


Ala 


Ala 


Val 


Gly 


Glu 


He 


Ser 


Val 


Asn 


Thr 


Ser 


Val Ala 








680 










685 








690 


Phe 


Leu 


Pro 


Tyr 


Met 


Glu 


Ser 


val 


Phe 


Glu 


Glu 


Val 


Phe 


Lys Leu 








695 










700 








705 


Leu 


Glu 


Cvs 


Pro 


His 


Leu 


Asn 


Val 


Arg 


Lys 


Ala 


Ala 


His 


Glu Ala 








710 










715 








720 


Leu 


Glv 


Gin 


Phe 


Cys 


Cys 


Ala 


Leu 


His 


Lys 


Ala 


Cys 


Gin 


Ser Cys 








725 










730 








735 


Pro 


Ser 


Glu 


Pro 


Asn 
740 


Thr 


Ala 


Ala 


Leu 


Gin 
745 


Ala 


Ala 


Leu 


Ala Arg 

750 


Val 


Val 


Pro 


Ser 


Tvr 
755 


Met 


Gin 


Ala 


Val 


Asn 
760 


Arg 


Glu 


Arg 


Glu Arg 
765 


nln 


Val 


Val 


Met 


Ala 
770 


Val 


Leu 


Glu 


Ala 


Leu 
775 


Thr 


Gly 


Val 


Leu Arg 
780 


SeiT 


Cys 


Glv 


Thr 


Leu 


Thr 


Leu 


Lys 


Pro 


Pro 


Gly 


Atq 


Leu 


Ala Glu 






785 










790 








795 






Glv 


Val 


Leu 


Lys 


Ala 


Val 


Leu 


Gin 


Arcr 


LVS 


Thr 


Ala Cys 






800 










805 








810 


Gin 


Asp 


Thr 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


ASO 


Asp 


Gin Ala 








815 










820 








825 


Glu 




Asp 


Ala 


Met 


Leu 


Leu 


Glu 


His 


Ala 


Gly 


Glu 


Ala 


He Pro 






830 










835 








840 


Ala 


Leu 


Ala 


Ala 


Ala 
845 


Ala 


Glv 


Gly 


Asp 


Ser 
850 


Phe 


Ala 


Pro 


Phe Phe 

855 


Ala 


Gly 


Phe 


Leu 


Pro 


Leu 


Leu 


Val 


Cvs 

V. jr w 


LVS 


Thr 


LvS 


Gin 


Gly Cys 








860 










865 








870 


Thr 


Val 


Ala 


Glu 


Lys 
875 


Ser 


Phe 


Ala 


Val 


Gly 
880 


Thr 


Leu 


Ala 


Glu Thr 
885 


He 


Gin 


Glv 


Leu 


Glv 


Ala 


Ala 


Ser 


Ala 


Gin 


Phe 


Val 


Ser 


Arg Leu 








890 










895 








900 


Leu 


Pro 


Val 


Leu 


Leu 
905 


Ser 


Thr 


Ala 


Gin 


Glu 
910 


Ala 


Asp 


Pro 


Glu Val 
915 


AXTQ 


Ser 


Asn 


Ala 


He 
920 


Phe 


Glv 


Met 


Gly 


Val 
925 


Xieu 


Ala 


Glu 


His Gly 
930 


Glv 


His 


Pro 


Ala 


Gin 


Glu 


His 


Phe 


Pro 


LVS 


Leu 


Leu 


Gly 


Leu Leu 








935 










940 








945 


Phe 


Pro 


Leu 


Leu 


Ala 

950 


Arcf 


Glu 


Arcr 


His 


Asp 
955 


Atq 


Val 


Arci 


Asp Asn 
960 


He 




Glv 


Ala 


Leu 


Ala 


Ar9 


Leu 


Leu 


Met 


Ala 


Ser 


Pro 


Thr Arg 








965 










970 








975 


Lys 


Pro 


Glu 


Pro 


Gin 
980 


Val 


Leu 


Ala 


Ala 


Leu 
985 


Leu 


His 


Ala 


Leu Pro 
990 


Leu 


LV6 


Glu 


Asp 


Leu 


Glu 


Glu 


Trp 


Val 


Thr 


He 


Gly 


Arg 


Leu Phe 






995 








1000 








1005 


Ser 


Phe 


Leu 


Tyr 


Gin 


Ser 


Ser 


Pro 


Asp Gin 


Val 


He 


Asp 


val Ala 








1010 








1015 








1020 


Pro 


Glu 


Leu 


Leu TLrg 


He 


Cys 


Ser 


Leu 


He 


Leu 


Ala 


Asp 


Asn Lys 








1025 








1030 








1035 


He 


Pro 


Pro 


Asp Thr 


Lys 


Ala 


Ala 


Leu 


Leu 


Leu 


Leu 


Leu 


Thr Phe 








1040 








1045 








1050 


Leu 


Ala 


Lys 


Gin His 


Thr 


Asp 


Ser 


Phe 


Gin 


Ala 


Ala 


Leu 


Gly Ser 






1055 








1060 








1065 


Leu 


Pro 


Val 


Asp Lys 
1070 


Ala 


Gin 


Glu 


Leu Gin 
1075 


Ala 


Val 


Leu 


Gly Leu 
1080 



Ser 
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<400> 3 








Met 


Tyr 


Val 


Arg 


Val 


1 








5 


His 


Leu 


Met 


Thr 


Lys 
20 


lie 


Ser 


Val 


His 


Gly 
35 


Leu 


Lys 


Gin 


Val 


Phe 
50 


Thr 


Gly 


Ala 


Trp 


He 
65 


Arg 


His 


Val 


Gly 


Asp 
80 


Gly 


Lys 


He 


Cys 


Thr 
95 


Asn 


Gin 


Glu 


Asp 


Leu 
110 


Thr 


Met 


Ser 


Asn 


Pro 
125 


His 


Ser 


His 


Phe 


He 
140 


Gly 


Ala 


Glu 


Val 


Lys 
155 


Leu 


Gin 


Lys 


He 


Asn 
170 


Ala 


Leu 


He 


Val 


Glu 
185 


Glu 


Tyr 


Leu 


Arg 


Asp 
200 


Gly 


Ser 


Gly 


Arg 


Ala 
215 


Ser 


Glu 


Glu 


Gly 


Gly 
230 


Leu 


Val 


Thr 


He 


Gin 
245 


Gin 


His 


Leu 


Phe 


He 
260 


Leu 


lie 


Thr 


Val 


Phe 

275 


Asp 


Leu 


Ala 


He 


Leu 
290 


Ala 


l»ro 


Asp 


Gin 


Leu 
305 


lie 


Ala 


Arg 


Ser 


Gin 

320 


Gly 


Ser 


Leu 


Glu 


Gin 
335 


VaX 


Asp 


Phe 


Val 


Lys 
350 


Arg 


Phe 


Leu 


Thr 


He 
365 


His 


Gly 


Pro 


Ser 


Asn 
380 


Ly& 


Gly 


Asn 


Leu 


Pro 
395 
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Ser 


Phe 


Asp 


Thr 


Lys 
10 


Glu 


Trp 


Gin 


Leu 


Glu 
25 


Gly 


Leu 


Gin 


Asn 


Phe 
40 


Gly 


Lys 


Gly 


Leu 


He 
55 


Phe 


Thr 


Gly 


Gly 


Val 
70 


Ala 


Leu 


Lys 


Asp 


His 
85 


He 


Gly 


He 


Ala 


Pro 
100 


He 


Gly 


Arg 


Asp 


val 
115 


Met 


Ser 


Lys 


Leu 


Thr 








130 


Leu 


Ala 


Asp 


Asn 


Gly 
145 


Leu 


Arg 


Arg 


Gin 


Leu 
160 


Thr 


Arg 


He 


Gly 


Gin 

175 


Gly 


Gly 


Pro 


Asn 


Val 
190 


Thr 


Pro 


Pro 


Val 


Pro 
205 


Ser 


Asp 


He 


Leu 


Ala 
220 


Leu 


He 


Asn 


Glu 


Ser 
235 


Lys 


Thr 


Phe 


Thr 


Tyx 
250 


He 


Leu 


Met 


Glu 


Cys 
265 


Arg 


Met 


Gly 


Ser 


Glu 
280 


Thr 


Ala 


Leu 


Leu 


Lys 
295 


Ser 


Leu 


Ala 


Leu 


Ala 
310 


He 


Phe 


He 


Tyr 


Gly 
325 


Ala 


Met 


Leu 


Asp 


Ala 
340 


Leu 


Leu 


He 


Glu 


Asn 
355 


Ser 


Arg 


Leu 


Glu 


Glu 

370 


Thr 


Leu 


Tyr 


His 


Leu 
385 


Pro 


Asp 


Tyr 


Arg 


He 
400 



Pro 


Asp 


Leu 


Leu 


Leu 
15 


Leu 


Pro 


Lys 


Leu 


Leu 
30 


Glu 


Leu 


Gin 


Pro 


Lys 
45 


Lys 


Ala 


Ala 


Met 


Thr 
60 


Asn 


Thr 


Gly 


Val 


He 

75 


Ala 


Ser 


Lys 


Ser 


Arg 
90 


Trp 


Gly 


He 


Val 


Glu 
105 


val 


Arg 


Pro 


Tyr 


Gin 
120 


Val 


Leu 


Asn 


Ser 


Met 
135 


Thr 


Thr 


Gly 


Lys 


Tyr 
150 


Glu 


Lys 


His 


He 


Ser 
165 


Gly 


Val 


Pro 


Val 


Val 

180 


He 


Ser 


He 


Val 


Leu 
195 


Val 


val 


val 


Cys 


Asp 
210 


Phe 


Gly 


His 


Lys 


Tyr 
225 


Leu 


Arg 


Asp 


Gin 


Leu 
240 


Thr 


Arg 


Thr 


Gin 


Ala 

255 


Met 


Lys 


Lys 


Lys 


Glu 
270 


Gly 


His 


Gin 


Asp 


He 
285 


Gly 


Ala 


Asn 


Ala 


Ser 
300 


Trp 


Asn 


Arg 


Val 


Asp 
315 


Gin 


Gin 


Trp 


Pro 


Val 

330 


Leu 


val 


Leu 


Asp 


Arg 
345 


Gly 


Val 


Ser 


Met 


His 
360 


Leu 


Tyr 


Asn 


Thr 


Arg 
375 


val 


Arg 


Asp 


Val 


Lys 
390 


Ser 


Leu 


He 


Asp 


He 
405 
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Gly 


Leu 


val 


He 


Glu 
410 


Tyr 


Leu 


Met 


Gly 


Gly 
415 


Ala 


Tyr 


Arg 


Cys 


Asn 
420 


Tyr 


Thr 


Arg 


Lys 


Arg 
425 


Phe 


Arg 


Thr 


Leu 


Tyr 
430 


His 


Asn 


Leu 


Phe 


Gly 
435 


Pro 


Lys 


Arg 


Pro 


Lys 
440 


Ala 


Leu 


Lys 


Leu 


Leu 
445 


Gly 


Met 


Glu 


Asp 


Asp 
450 


He 


Pro 


Leu 


Arg 


Arg 
455 


Gly 


Arg 


Lys 


Thr 


Thr 
460 


Lys 


Lys 


Arg 


Glu 


Glu 
465 


Glu 


Val 


Asp 


He 


Asp 
470 


Leu 


Asp 


Asp 


Pro 


Glu 
475 


He 


Asn 


His 


Phe 


Pro 
480 


Phe 


Pro 


Phe 


His 


Glu 
485 


Leu 


Met 


Val 


Tno 


Ala 
490 


Val 


Leu 


Met 


Lvs 


Ara 
495 


Gin 


Lvs 


Met 


Ala 


Leu 
500 


Phe 


Phe 


T3Ct> 


Gin 


His 

505 


Gly 


Glu 


Glu 


Ala 


Met 
510 


Ala 




Ala 


Leu 


Val 
515 


Ala 


Cys 


Lys 


Leu 


Cys 
520 


Lys 


Ala 


Met 


Ala 


His 
525 


Glu 


Ala 


Ser 


Glu 


Asn 
530 


Asp 


Met 


Val 


Asp 


Asp 
535 


He 


Ser 


Gin 


Glu 


Leu 
540 


Asn 


His 


Asn 


Ser 


Arg 
545 


Asp 


Phe 


Glv 


Gin 


Leu 
550 


Ala 


Val 


Glu 


Leu 


Leu 

555 


Asp 


Gin 


Ser 


* Jr 


Lys 
560 


Gin 


Asp 


Glu 


Gin 


Leu 
565 


Ala 


Met 


Lys 


Leu 


Leu 
570 


Tlur 


Tvr 


Glu 


Leu 


Lvs 


Asn 


Tro 


Ser 


Asn 


Ala 


Thr 


Cvs 


Leu 


Gin 


Leu 








575 










580 










585 


Ala 


Val 


Ala 


Ala 


Lvs 
590 


His 


Ara 


Asp 


Phe 


He 
595 


Ala 


His 


Thr 


Cys 


Ser 
600 


Gin 


Met 


Leu 


Leu 


Thr 
605 


Asp 


Met 


Tro 


Met 


Glv 
610 


Arg 


Leu 


Arg 


Met 


Arg 
615 




Asn 


Ser 


Glv 


Leu 
620 


Lys 


Val 


He 


Leu 


Glv 

625 


He 


Leu 


Leu 


Pro 


Pro 
630 


Ser 


He 


Leu 


Ser 


Leu 
635 


Glu 


Phe. 


Lvs 


Asn 


Lys 
640 


Asp 


ASD 


Met 


Pro 


Tvr 
645 


Met 


Ser 


Gin 


Ala 


Gin 
650 


Glu 


He 


His 


Leu 


Gin 

655 


Glu 


Iiys 


Glu 


Ala 


Glu 
660 


Glu 


Pro 


Glu 


Lys 


Pro 
665 


Thr 


Lys 


Glu 


Lys 


Glu 
670 


Glu 


Glu 


Asp 


Met 


Glu 
675 


Lou 


He 


Ala 


Met 


Leu 
680 


Glv 


Arc 


Asn 


Asn 


Gly 
685 


Glu 


Ser 


Ser 


Arci 


Lvs 
690 


Lys 


Asp 


Glu 


Glu 


Glu 


Val 


Gin 


Ser 


Glu 


His 


Arg 


Leu 


He 


Pro 


Leu 








695 










700 










705 


Glv 


Aro 


Lvs 


He 


Tvr 
710 


Glu 


Phe 


Tvr 


Asn 


Ala 
715 


Pro 


He 


Val 


Lv8 


Phe 
720 


Trt> 


Phe 


Tvr 


Thr 


Leu 
725 


Ala 


Tvr 


He 


Gly 


Tyr 
730 


lieu 


Met 


Leu 


Phe 


Asn 

735 


Tyr 


He 


Val 


Leu 


Val 
740 


Lys 


Met 


Glu 


Arg 


Trp 
745 


Pro 


Pro 


Thr 


Gin 


Glu 
750 


Trp 


He 


Val 


He 


Ser 
755 


Tyr 


He 


Phe 


Thr 


Leu 
760 


Gly 


He 


Glu 


Lys 


Met 
765 


Arg 


Glu 


He 


Leu 


Met 
770 


Ser 


Glu 


Pro 


Gly 


Lys 
775 


Leu 


Leu 


Gin 


Lys 


Val 
780 


Ly@ 


Val 


Trp 


Leu 


Gin 
785 


Glu 


His 


Trp 


Asn 


Val 
790 


Thr 


Asp 


Leu 


He 


Ala 
795 


He 


Leu 


Leu 


Phe 


Ser 
800 


Val 


Gly 


Met 


He 


Leu 
805 


Arg 


Leu 


Gin 


Asp 


Gin 
810 


Pro 


Phe 


Arg 


Ser 


Asp 
815 


Gly 


Arg 


Val 


He 


Tyr 
820 


Cys 


Val 


Asn 


He 


He 

825 


Tyr 


Trp 


Tyr 


He 


Arg 
830 


Leu 


Leu 


Asp 


He 


Phe 
835 


Gly 


Val 


Asn 


Lys 


Tyr 
840 


Leu 


Gly 


Pro 


Tyr 


Val 
845 


Met 


Met 


He 


Gly 


Lys 
850 


Met 


Met 


He 


Asp 


Met 
855 


Met 


Tyr 


Phe 


Val 


He 
860 


He 


Met 


Leu 


Val 


Val 
865 


Leu 


Met 


Ser 


Phe 


Gly 
870 


Val 


Ala 


Arg 


Gin 


Ala 


He 


Leu 


Phe 


Pro 


Asn 


Glu 


Glu 


Pro 


Ser 


T3cp 
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875 






Lys 


Leu 


Ala 


Lys Asn 
890 


He 


Phe 


Gly 


Glu 


Val 


Phe Ala 
905 


Asp 


Gin 


Glu 


Thr 


Arg 


Glu Asp 
920 


Gly 


Lys 


Thr 


Gly 


Ala 


Trp He 
935 


Val 


Pro 


Val 


Ala 


Asn 


He Leu 
950 


Leu 


Val 


Asn 


Thr 


Phe 


Phe Glu 
965 


Val 


Lys 


Phe 


Gin 


Arg 


Tyr Gin 
980 


Leu 


He 


Leu 


Pro 


Pro 


Pro Leu 

995 


He 


He 


Gin 


His 


Leu 


Cys Cys 
1010 


Arg 


Trp 


Glu 


Arg 


Asp 


Tyr Gly 
1025 


Leu 


Lys 


Lys 


Lys 


Val 


His Asp 
1040 


Phe 


Glu 


Arg 


Glu 


Lys 


Asp Asp 
1055 


Arg 


Phe 


Arg 


Val 


Thr 


Ser Glu 
1070 


Arg 


Val 


Glu 


Val 


Asn 


Glu Arg 
1085 


Glu 


His 


Val 


Asp 


He 


Arg Leu 
1100 


Ala 


Gin 


Ala 


Thr 


Ala 


Leu Glu 
1115 


Arg 


Leu 


Asn 


Lys 


He 


Arg Ser 
1130 


Arg 


Thr 


Tyr 


He 


Val 


Arg Gin 
1145 


Ser 


Ser 


Phe 


Lys 


Leu 


Gin Glu 
1160 


Ser 


He 


Ser 


Val 













880 






885 


Tyr 


Met Pro 


Tyr 


Trp Met 


He Tyr 




895 






900 


He 


Asp Pro 


Pro 


Cys Gly 


Gin Asn 




910 






915 


He 


He Gin 


Leu 


Pro Pro 


Cys Lys 




925 






930 


Ala 


He Met 


Ala 


Cys Tyr 


Leu Leu 




940 






945 


Asn 


Leu Leu 


He 


Ala Val 


Phe Asn 




955 






960 


Ser 


He Ser 


Asn 


Gin Val 


Trp Lys 




970 






975 


Met 


Thr Phe 


His 


Glu Arg 


Pro Val 




985 






990 


Phe 


ser His 


Met 


Thr Met 


He Phe 




1000 






1005 


Arg 


Lys His 


Glu 


Ser Asp 


Pro Asp 




1015 






1020 


Leu 


Phe He 


Thr 


Asp Asp 


Glu Leu 




1030 






1035 


Glu 


Gin Cys 


He 


Glu Glu 


Tyr Phe 




1045 






1050 


Asn 


Ser Ster 


Asn 


Asp Glu 


Arg He 




1060 






1065 


Glu 


Asn Met 


Ser 


Met Arg 


Leu Glu 




1075 






1080 


Ser 


Met Lys 


Ala 


Ser Leu 


Gin Thr 




1090 






1095 


Leu 


Glu Asp 


Leu 


He Gly 


Arg Met 




1105 






1110 


Thr 


Gly Leu 


Glu 


Arg Ala 


Glu Ser 




1120 






1125 


Ser 


Ser Asp 


Cys 


Thr Asp 


Ala Ala 




1135 






1140 


Phe 


Asn Ser 


Gin 


Glu Gly 


Asn Thr 




1150 






1155 


Asp 


Pro Ala 


Glu 


His Pro 


Leu Tyr 




1165 






1170 
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<400> 4 



Met 


Glu 


He 


Ala 


Gly 


Glu Lys Lys 


Tyr 


Gin 


Tyr 


Tyr Ser 


Met Pro 


1 








5 






10 






15 


Ser 


Ser 


Leu 


Leu 


Arg 


Ser Leu Ala 


Gly 


Gly 


Arg 


His Pro 


Pro Arg 










20 






25 






30 


Val 


Gin 


Arg 


Gly 


Ala 


Leu Leu Glu 


He 


Thr 


Asn 


Ser Lys 


Arg Glu 










35 






40 






45 


Ala 


Thr 


Asn 


Val 


Arg 


Asn Asp Gin 


Glu 


Arg 


Gin 


Glu Thr 


Gin Ser 










50 






55 






60 


Ser 


He 


Val 


Val 


Ser 


Gly Val Ser 


Pro 


Asn 


Arg 


Gin Ala 


His Ser 










65 






70 






75 


Lys Tyr 


Gly 


Gin 


Phe 


Leu Leu Val 


Pro 


Ser 


Asn 


Leu Lys 


Arg Val 
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80 










85 










90 


Pro 


Phe 


Ser 


Ala 


Glu 
95 


Thr 


Val 


Leu 


Pro 


Leu 
100 


Ser 


Arg 


Pro 


Ser 


Val 
105 


Pro 


Asp 


Val 


Leu 


Ala 
110 


Thr 


Glu 


Gin 


Asp 


He 

115 


Gin 


Thr 


Glu 


Val 


Leu 
120 


Val 


His 


Leu 


Thr 


Gly 

125 


Gin 


Thr 


Pro 


Val 


Val 
130 


Ser 


Asp 


Trp 


Ala 


Ser 
135 


Val 


Asp 


Glu 


Pro 


Lys 
140 


Glu 


Lys 


His 


Glu 


Pro 
145 


He 


Ala 


His 


Leu 


Leu 
150 


Asp 


Gly 


Gin 


Asp 


X'YS 
155 


Ala 


Glu 


Gin 


val 


Leu 
160 


Pro 


Thr 


Leu 


Ser 


Cys 
165 


Thr 


Pro 


Glu 


Pro 


Met 
170 


Thr 


Met 


Ser 


Ser 


Pro 
175 


Leu 


Ser 


Gin 


Ala 


Lys 
180 


He 


Met 


Gin 


Thr 


Gly 
185 


Gly 


Gly 


Tyr 


val 


Asn 
190 


Trp 


Ala 


Phe 


Ser 


Glu 
195 


Gly 


Asp 


Glu 


Thr 


Gly 
200 


Val 


Phe 


Ser 


He 


Lys 
205 


Lys 


Lys 


Trp 


Gin 


Thr 
210 


Cys 


Leu 


Pro 


Ser 


Thr 
215 


Cys 


Asp 


Ser 


Asp 


Ser 
220 


Ser 


Arg 


Ser 


Glu 


Gin 
225 


His 


Gin 


Lys 


Gin 


Ala 
230 


Gin 


Asp 


Ser 


Ser 


Leu 
235 


Ser 


Asp 


Asn 


Ser 


Thr 
240 


Arg 


Ser 


Ala 


Gin 


Ser 


Ser 


Glu 


Cys 


Ser 


Glu 


Val 


Gly 


Pro 


Trp 


Leu 








245 










250 










255 


Gin 


Pro 


Asn 


Thr 


Ser 
260 


Phe 


Trp 


He 


Asn 


Pro 

265 


Leu 


Arg 


Arg 


Tyr 


Arg 
270 


Pro 


Pha 


Ala 


Arg 


Ser 
275 


His 


Ser 


Phe 


Arg 


Phe 
280 


His 


Lys 


Glu 


Glu 


Lys 
285 


Leu 


Met 


Lys 


He 


Cys 

290 


Lys 


He 


Lys 


Asn 


Leu 

295 


Ser 


Gly 


Ser 


Ser 


Glu 

300 


lie 


Gly 


Gin 


Gly 


Ala 

305 


Trp 


Val 


Lys 


Ala 


Lys 
310 


Met 


Leu 


Thr 


Lys 


Asp 
315 


Arg 


Arg 


Leu 


Ser 


Lys 
320 


Lys 


Lys 


Lys 


Asn 


Thr 
325 


Gin 


Gly 


Leu 


Gin 


Val 

330 


Pro 


He 


He 


Thr 


Val 
335 


Asn 


Ala 


Cys 


Ser 


Gin 
340 


Ser 


Asp 


Gin 


Leu 


Asn 
345 


Pro 


Glu 


Pro 


Gly 


Glu 
350 


Asn 


Ser 


He 


Ser 


Glu 
355 


Glu 


Glu 


Tyr 


Ser 


Lys 
360 


A6n 


Trp 


Phe 


Thr 


Val 
365 


Ser 


Lys 


Phe 


Ser 


His 
370 


Thr 


Gly 


Val 


Glu 


Pro 
375 


Tyr 


He 


His 


Gin 


Lys 
380 


Met 


Lys 


Thr 


Lys 


Glu 
385 


He 


Gly 


Gin 


Cys 


Ala 
390 


He 


Gin 


He 


Ser 


Asp 
395 


Tyr 


Leu 


Lys 


Gin 


Ser 
400 


Gin 


Glu 


Ser 


Ala 


Gin 
405 


Asp 


Leu 


Ser 


Lys 


Asn 
410 


Ser 


Leu 


Trp 


Asn 


Ser 
415 


Arg 


Ser 


Thr 


Asn 


Leu 
420 


Asn 


Arg 


Asn 


Ser 


Leu 
425 


Ser 


Ser 


Leu 


He 


Ser 
430 


Glu 


He 


Ser 


Ala 


Ser 
435 


Leu 


Lys 


Ser 


Pro 


Gin 
440 


Glu 


Pro 


His 


His 


His 
445 


Tyr 


Ser 


Pro 


Ser 


Leu 
450 


Leu 


Phe 


Ala 


Ala 


Gly 
455 


Glu 


Glu 


He 


Thr 


Val 
460 


Tyr 


Arg 


Leu 


Glu 


Glu 
465 


Ser 


Ser 


Pro 


Leu 


Asn 
470 


Leu 


Asp 


Lys 


Ser 


Met 
475 


Ser 


Ser 


Trp 


Ser 


Gin 
480 


Arg 


Gly 


Arg 


Ala 


Ala 
485 


Met 


He 


Gin 


Val 


Leu 
490 


Ser 


Arg 


Glu 


Glu 


Met 
495 


Asp 


Gly 


Gly 


Leu 


Arg 
500 


Lys 


Ala 


Met 


Arg 


Val 
505 


Val 


Ser 


Thr 


Trp 


Ser 
510 


Glu 


Asp 


Asp 


He 


Leu 
515 


Lys 


Pro 


Gly 


Gin 


Val 

520 


Phe 


He 


val 


Lys 


Ser 

525 


Phe 


Leu 


Pro 


Glu 


Val 
530 


val 


Arg 


Thr 


Trp 


His 
535 


Lys 


He 


Phe 


Gin 


Glu 
540 


Ser 


Thr 


Val 


Leu 


His 
545 


Leu 


Cys 


Leu 


Arg 


Glu 
550 


He 


Gin 


Gin 


Gin 


Arg 
555 



9/83 



Ala 


Ala 


Gin 


Lys Leu 








560 


Thr 


He 


Pro 


Tyr Thr 








575 


Cys 


His 


Ser 


Ala Asn 






590 


Gly 


Glu 


Phe 


Arg Lys 








605 


Pro 


Thr 


Asn 


Thr Leu 








620 


Thr 


Tyr 


Glu 


Tyr Thr 








635 


Gly 


Val 


Gly 


Glu Asn 








650 


Val 


Ly8 


Gin 


Ser Arg 








665 


Glu 


Asp 


Ala 


He Arg 








680 


Cvs 


Cvs 


ArCT 


Lys Leu 








695 


Ser 


Pro 


Glu 


Arg He 








710 


Glu 


Ser 


Ala 


Glu Glu 








725 


Pro 


Glu 


Asp 


Asp Met 








740 



<210> 5 
<211> 577 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 
<223> Incyte ID No: 



<400> 5 








Met 


Gly 


Ser 


Arg 


His 


1 








5 


Phe 


Gly 


Arg 


Phe 


Gin 
20 


ASn 


He 


Ser 


Cys 


Gly 
35 


val 


Thr 


Pro 


His 


His 
50 


Val 


Val 


Phe 


His 


Asn 

65 


Ala 


Leu 


Leu 


Ser 


Ser 
80 


Gin 


Asn 


Gly 


Glu 


He 
95 


Arg 


Glu 


Asn 


Thr 


Ser 
110 


Lys 


Glu 


Phe 


Pro 


Cys 
125 


Trp 


Lys 


Ser 


Thr 


Ala 
140 


Lys 


Trp 


Leu 


Ala 


Met 

155 


Leu 


Leu 


Gly 


Ser 


Val 
170 


Arg 


Arg 


Val 


Val 


Leu 
185 



He 


Tyr 


Thr 


Phe Asn 






565 


Pro 


Arg 


Phe 


Leu Glu 








580 


Gin 


Trp 


Leu 


Thr He 








595 


Tyr 


Asn 


Asn 


Asn Asn 








610 


Glu 


Glu 


Leu 


Met Leu 








625 


Arg 


Gly 


Glu 


Leu Leu 








640 


Leu 


Thr 


ASD 


Pro Ser 






655 


Gly 


Met 


Val 


Phe Gly 








670 


Asn 


Phe 


He 


Ala Lys 








685 


Lys 


Leu 


Pro 


Asp Leu 








700 


Asn 


Ser 


Thr 


Phe Gly 








715 


Pro 


Pro 


Ala 


Arg Glu 








730 


Gin 


Leu 







4787433CD1 



Phe 


Glu 


Gly 


He Tyr 








10 


Arg 


Val 


Leu 


Tyr Phe 








25 


He 


His 


Tyr 


Leu Ala 








40 


Val 


Cys 


Arg 


Pro Pro 








55 


His 


Ser 


Asn 


Trp Ser 








70 


Gly 


Gin 


Lys 


Asp Tyr 








85 


Trp 


Glu 


Leu 


Ser Arg 








100 


Ser 


Leu 


Gly 


Tyr Glu 








115 


Val 


Asp 


Gly 


Tyr He 








130 


Val 


Thr 


Gin 


Trp Asn 








145 


Leu 


He 


Gin 


Pro Leu 








160 


Thr 


Phe 


Gly 


Tyr Phe 








175 


Trp 


Ala 


Thr 


Ser Ser 








190 



Gin 


Val Lys 


Pro 


Gin 








570 


Val 


Phe Leu 


He 


Tyr 








585 


Glu 


Lys Tyr 


Met 


Thr 








600 


Gly 


Asp Glu 


He 


Thr 








615 


Ala 


Phe Ser 


His 


Trp 








630 


Val 


Leu Asp 


Leu 


Gin 








645 


Val 


He Lys 


Pro 


Glu 








660 


Pro 


Ala Asn 


Leu 


Gly 








675 


His 


His Trp 


Asn 


Ser 








690 


Lys 


Arg Asn 


Asp 


Tyr 








705 


Leu 


Glu He 


Lys 


He 








720 


Thr 


Gly Arg 


Asn 


Ser 



735 



Asp 


His Val 


Gly 


His 








15 


He 


Cys Ala 


Phe 


Gin 








30 


Ser 


Val Phe 


Met 


Gly 








45 


Gly 


Asn Val 


Ser 


Gin 








60 


Leu 


Glu Asp 


Thr 


Gly 








75 


Val 


Thr Val 


Gin 


Leu 








90 


Cys 


Ser Arg 


Asn 


Lys 








105 


Tyr 


Thr Gly 


Ser 


Lys 








120 


Tyr 


Asp Gin 


Asn 


Thr 








135 


Leu 


val Cys 


Asp 


Arg 








150 


Phe 


Met Phe 


Gly 


val 








165 


Ser 


Asp Arg 


Leu 


Gly 








180 


Ser 


Met Phe 


Leu 


Phe 



195 
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Gly 


He 


Ala 


Ala Ala 


Phe 


Ala 


Val 






200 








Ala 


Arg 


Phe 


Phe Leu 
215 


Ala 


Met 


Val 


Gly 


Phe 


Val 


Tyr Val 
230 


Met 


Glu 


Phe 


rrrp 


Ala 


Ser 


Val His 


Leu 


His 


Ser 






245 








Leu 


Val 


Ala 


Leu Thr 
260 


Gly 


Tyr 


Leu 


Gin 


Met 


He 


Leu Ser 
275 


Thr 


Val 


Thr 


Trp 


Val 


Leu 


Pro Glu 
290 


Thr 


Pro 


Phe 


Tyr 


Glu 


Glu 


Ala Gin 
305 


Lys 


He 


Val 


Arg 


Ala 


Ser 


Ser Cys 
320 


Lys 


Leu 


Ser 


Gin 


Gly 


Pro 


Val Ser 

335 


Asn 


Ser 


Pro 


Leu 


Ser 


Tyr 


Leu Phe 
350 


Tyr 


Asn 


Tirp 


Thr 


Val 


Trp 


Leu He 
365 


Trp 


Phe 


Thr 


Phe 


Ser 


Leu 


Asn Ser 
380 


Val 


Asn 


Leu 


Leu 


Phe 


Leu 


Leu Gly 
395 


Val 


Val 


Glu 


Cys 


Xle 


Ala 


Met Asp 
410 


Lys 


val 


Gly 


Ser 


Leu 


Phe 


Cys Ser 
425 


Ala 


Leu 


Ala 


Pro 


Gin 


Lys 


His Tyr 
440 


He 


Leu 


Gly 


Lys 


Phe 


Ala 


He Gly 
455 


Ala 


Ala 


Phe 


Ala 


Glu 


Leu 


Tyr Pro 
470 


Thr 


He 


Val 


Gly 


Ser 


Met 


Val Cys 
485 


Arg 


Leu 


Ala 


Val 


Asp 


Leu 


Ser Ser 
500 


He 


Trp 


He 


Gly 


Thr 


Met 


Ala Leu 
515 


Leu 


Ser 


Gly 


Glu 


Thr 


Leu 


Gly Lys 
530 


Arg 


Leu 


Ala 


Lys 


Leu 


Glu 


Ser Glu 
545 


Asn 


Glu 


Ser 


Thr 


Thr 


Asn 


Asn Ser 
560 


Gly 


Leu 


Glu 


Arg 


Asp 


Ser 


Gly Leu 
575 


Gly 


Glu 





<210> 6 
<211> 462 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc„feature 

<223> Incyte ID No: 7483598CD1 

<400> 6 



Asp Tyr 


Tyr Thr 


Phe Met Ala 




205 




210 


Ala 


Ser 


Gly Tyr 


Leu Val Val 




220 




225 


He Gly 


Met Lys 


Ser Arg Thr 




235 




240 


Phe 


Phe 


Ala val 


Gly Thr Leu 




250 




255 


Val 


Arg 


Thr Trp 


Trp Leu Tyr 




265 




270 


Val 


Pro 


Phe He 


Leu Cys Cys 




280 




285 


Trp 


Leu 


Leu Ser 


Glu Gly Arg 




295 




300 


Asp 


lie 


Met Ala 


Lys Trp Asn 




310 




315 


Glu 


Leu 


Leu Ser 


Leu Asp Leu 




325 




330 


Thr 


Glu 


Val Gin 


Lys His Asn 




340 




345 


Ser 


He 


Thr Lys 


Arg Thr Leu 




355 




360 


Gly 


Ser 


Leu Gly 


Phe Tyr Ser 




370 




375 


Gly Gly 


Asn Glu 


Tyr Leu Asn 




385 




390 


Xle 


Pro 


Ala Tyr 


Thr Phe Val 




400 




405 


Arg hxg 


Thr val 


Leu Ala Tyr 




415 




420 


Cys Gly 


Val val 


Met Val He 




430 




435 


Val 


Val 


Thr Ala 


Met Val Gly 




445 




450 


Gly Leu 


He Tyr 


Leu Tyr Thr 




460 




465 


Arg 


Ser 


Leu Ala 


Val Gly Ser 




475 




480 


Ser 


He 


Leu Ala 


Pro Phe Ser 




490 




495 


Phe 


He 


Pro Gin 


Leu Phe Val 




505 




510 


Val 


Leu 


Thr Leu 


Lys Leu Pro 




520 




525 


Thr 


Thr 


Trp Glu 


Glu Ala Ala 




535 




540 


Lys 


Ser 


Ser Lys 


Leu Leu Leu 




550 




555 


Lys 


Thr 


Glu Ala 


He Thr Pro 




565 




570 
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Met 
1 

Leu 


Glv 
ASD 


Tvr 

ASD 


Lys 


Thr 


Cys 


He 


Glv 


S&x 


Glv 


Phe 


* * v» 


lilX 


ABp 


It I Its 




nut. 


Asp 


trJL, \j 




'Put* 


xjie 


Ax a 




Lys 


VaX 


fne 






Aircf 






Leu 


OCX 


Leu 


x±e 


Ma ft* 

net 


AXa 


Aircf 


Asp 


Axa 


Trp 


vaX 


nee 


Ser* 




Ser 


Ser 


Tl A 


HxS 


nei. 


AXcl 


JLJiir 


Cys 




irne 








Cys 


Cys 


irne 


vax 


ASH 


Letl 


£>ex 


1 IIL. 


v«x 








r^l 11 

V7XU 


He 


He 


Pro 


11 IX 


XIXd 


otsx 


Ala 


Val 


Val 


Gin 


Asp 


Cys 


Asn 


Phe 


Sex 


Thr 


Gin 


Leu 



Gin 


Arg 


Gin 




5 




Arg 


Glu 


Thr 




20 




Gin 


Ser 


Ala 




35 




Gly 


He 


He 




50 




Leu 


Gly 


He 




65 




Ser 


Leu 


Val 




80 




Thr 


Tyr 


Gin 




95 




Leu 


Leu 


Leu 




110 




He 


Ser 


Tyr 




125 




Gin 


Arg 


He 




140 




His 


Phe 


He 




155 




Ser 


Leu 


Tyr 




170 




Ser 


Thr 


Gly 




185 




Ala 


He 


Ser 




200 




Val 


Phe 


Ala 




215 




Phe 


Ala 


Phe 




230 




Leu 


Glu 


Glu 




245 




Ser 


He 


Val 




260 




Gly 


Tyr 


Leu 




275 




Asn 


Tyr 


Cys 




290 




Tyr 


Gly 


val 




305 




Thr 


Arg 


Glu 




320 




Ser 


Val 


Phe 




335 




Thr 


Leu 


Val 




350 




Leu 


Asn 


Gly 




365 




Ser 


Ala 


Cys 




380 




Asp 


Lys 


He 




395 




Met 


Val 


Phe 




410 




Thr 


His 


Gly 




425 




Leu 


Thr 


Asn 




440 




Ser 


Thr 


Leu 




455 





Glu Pro Val 
Leu val Ser 
Ala Leu Phe 
Gly Leu Pro 
Leu Leu Leu 
Leu Leu He 
Ser Xidu Val 
Ser Val Leu 
Asn He He 
Pro Gly Val 
He Gly Leu 
Arg Asn lie 
Leu Thr Thr 
Leu Gly Pro 
Lys Pro Asn 
He Cys His 
Pro Thr Val 
He Ser Val 
Thr Phe Thr 
Arg Asn Asp 
Thr Val He 
Val He Ala 
His He Val 
Ser Leu Leu 
Val Leu Cys 
Tyr Leu Lys 
Met Ser Cys 
Gly Phe Val 
Gin Glu Met 
Thr Ser Glu 
Asn He Ser 



He 


Pro 


Pro 


10 






Glu 


His 


Glu 


25 






Asn 


Val 


val 


40 






Tyr 


Ser 


Met 


55 






Phe 


Trp 


Val 


70 






Lys 


Gly 


Gly 


85 






Asn 


Lys 


Thr 


100 






Gin 


Phe 


Leu 


115 






Ala 


Gly 


Asp 


130 






Asp 


Pro 


Glu 


145 






Ser 


Thr 


Val 


160 






Ala 


Lys 


Leu 


175 






Leu 


lie 


Leu 


190 






His 


He 


Pro 


205 






Ala 


He 


Gin 


220 






His 


Asn 


Ser 


235 






Ala 


Lys 


Trp 


250 






Phe 


He 


Cys 


265 






Gly 


Phe 


Thr 


280 






Asp 


Leu 


Val 


295 






Leu 


Thr 


Tyr 


310 






Asn 


Val 


Phe 


325 






Val 


Thr 


Val 


340 






He 


Asp 


Cys 


355 






Ala 


Thr 


Pro 


370 






Leu 


ser 


Glu 


385 






Val 


Met 


Leu 


400 






Met 


Ala 


He 


415 






Phe 


Tyr 


Cys 


430 






Ser 


His 


Val 


445 






He 


Phe 


Gin 


460 







Gin 


Arg 


Asp 






15 


Tyr 


Lys 


Glu 






30 


Asn 


Ser 


He 






45 


Lys 


Gin 


Ala 






60 


Ser 


Tyr 


Val 






75 


Ala 


Leu 


Ser 






90 


Phe 


Gly 


Phe 






105 


Tyr 


Pro 


Phe 






120 


Thr 


Leu 


Ser 






135 


Asn 


Val 


Phe 






150 


Thr 


Phe 


Thr 






165 


Gly 


Lys 


Val 






180 


Gly 


He 


Val 






195 


Lys 


Thr 


Glu 






210 


Ala 


val 


Gly 






225 


Phe 


Leu 


val 






240 


Ser 


Arg 


Leu 






255 


He 


Phe 


Phe 






270 


Gin 


Gly 


Asp 






285 


Thr 


Phe 


Gly 






300 


Pro 


Net 


Glu 






315 


Phe 


Gly 


Gly 






330 


Met 


Val 


He 






345 


Leu 


Gly 


He 






360 


Leu 


He 


Phe 






375 


Glu 


Pro 


Arg 






390 


Pro 


He 


Gly 






405 


Thr 


Asn 


Thr 






420 


Phe 


Pro 


Asp 






435 


Gin 


Gin 


Thr 






450 
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<210> 7 
<211> 1018 
<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc„feature 

<223> Incyte ID No: 7484823CD1 

<400> 7 









^?xy 


Asp 


OCX 


vtxy 


Leu 


Arg 


Ser 


Pro 


Arg 


PI V 


Pro 


PI n 


1 








5 










10 










15 




civ 


i»J-jr 


Lys 


Cys 
20 


Ser 


Arg 


1^1 11 


jjya 


uxn 
25 


T.vre 


Arg 


Asn 


Ma^ 


Pill 
uXU 

30 


f 1 11 


Leu 


Lys 


Lys 


35 


vax 


VclJ- 




Asp 


Asp 
40 


xixs 


Lys 


Leu 


JL jir 


Leu 
45 


r51 n 


UJ.U 


Leu 


Ser 


Thr 
50 


Lys 


Tyr 


Scr 


vaX 


Asp 
55 


Leu 




Lys 


PI vr 

\jxy 


nXS 

60 




tlXS 




Arg 


Axa 

65 


Lys 


PI 11 
uxu 


xxe 


Leu 


illX 

70 


Arg 


Pi vr 


Pi vr 

i^xy 


Pro 


Asn 
75 


rpVi V 

jiiur 


va J. 


xnr 


Pro 


Pro 
80 


Pro 


inx 


rrVi V*- 

jLiur 


Pro 


Pi 11 

IjXU 

85 


Trp 


vax 


Lys 


troe 


Cys 
90 






Xieu 




vixy 
95 


uxy 


file 


<2oi" 


Leu 


Leu 
100 




Trp 






Axa 
105 


Tl A 


ijeu 


Cys 


Fxie 


vax 
110 


AX a 


Tyr 


Ser 




Pin 

1X5 


Tl a 

xxe 


Tyr 


Jrne 


Asn 


Pi 11 
19XU 

120 


^1 ft 


P3CO 


inr 


Lys 


Asp 
125 


Asn 


Leu 


Tyr 


Leu 


s>er 
130 


Tl 0 

xxe 


vax 


Leu 




Vxal 

vax 
135 


y&± 


vajL 


Tl & 

xxe 


vax 


Tiii 

140 


uxy 


Cys 




Ser 


Tyr 
145 


Tyr 


Pi Tl 


PI 11 


&1 SI 

AXa 


Lys 
150 






Lys 


Tl A 

xxe 


Mot' 

155 


Pi It 




l»IA 


Lys 


Asn 
160 


nec 


Val 

vax 


Pro 


tf!21n 
uXll 


vxn 
165 




Ij6U 


VaI 


XXe 


Arg 
170 


oxy 


oxy 




Lys 


Mot* 

175 


/21n 
vXIt 


Tl A 

xxe 


Asn 


Val 
vax 


(11 n 
vvxn 

180 






Val 


Leu 


185 


Asp 


Leu 


VcAX 


fZl n 

uxu 


Tie 

190 


Lys 


fil V 

laxy 


v»xy 


Aon 


Arg 
195 


vc** 


Pro 




Asp 


Leu 
200 






TTa 

XXV 




Ala 

AX CI 

205 


nln 


uxy 


Pvsi 
v«yo 


Lys 


Val 
va J. 

210 


Asp 


Ash 


Ser 


Ser 


Leu 
215 


Thr 


Glv 


Glu 


Ser 


Glu 
220 


Pro 


Gin 


Ser 


Arg 


Ser 
225 


Pxo 


Asp 


Phe 


Thr 


His 
230 


Glu 


Asn 


Pro 


Leu 


Glu 
235 


Thr 


Arg 


Asn 


He 


Cys 
240 


Phe 


Phe 


Ser 


Thr 


Asn 

245 


Cys 


Val 


Glu 


Glv 


Thr 

250 


Ala 


Arcr 


Gly 


lie 


Val 

255 


lie 


Ala 


Thr 


Gly 


Asp 
260 


Ser 


Thr 


val 


Met 


Glv 
265 


Arg 


He 


Ala 


Ser 


Leu 

270 


Thr 


Ser 


Gly 


Leu 


Ala 
275 


Val 


Gly 


Gin 


Thr 


Pro 
280 


He 


Ala 


Ala 


Glu 


He 
285 


Glu 


His 


Phe 


lie 


His 
290 


Leu 


lie 


Thr 


Val 


Val 

295 


Ala 


Val 


Phe 


Leu 


Gly 
300 


Val 


Thr 


Phe 


Phe 


Ala 
305 


Leu 


Ser 


Leu 


Leu 


Leu 
310 


Gly 


Tyr 


Gly 


Trp 


Leu 
315 


Glu 


Ala 


lie 


lie 


phe 
320 


Leu 


He 


Gly 


He 


He 
325 


Val 


Ala 


Asn 


Val 


Pro 
330 


Glu 


Gly 


Leu 


Leu 


Ala 

335 


Thr 


val 


Thr 


val 


Cys 
340 


Leu 


Thr 


Leu 


Thr 


Ala 
345 


Lys 


Arg 


Met 


Ala 


Arg 

350 


Lys 


Asn 


Cys 


Leu 


Val 
355 


Lys 


Asn 


Leu 


Glu 


Ala 
360 


Val 


Glu 


Thr 


Leu 


Gly 
365 


Ser 


Thr 


Ser 


Thr 


He 
370 


Cys 


Ser 


Asp 


Lys 


Thr 
375 


Gly 


Thr 


Leu 


Thr 


Gin 
380 


Asn 


Arg 


Met 


Thr 


Val 
385 


Ala 


His 


Met 


Trp 


Phe 
390 


Asp 


Met 


Thr 


Val 


Tyr 


Glu 


Ala 


Asp 


Thr 


Thr 


Glu 
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Cys 


He 


865 








870 


Pro His 


Leu Gly 


Gly 


Glu 


Leu 


880 








885 


Ala Thr 


Thr Trp 


Leu 


Phe 


Val 


895 








900 


Phe Ala 


Thr Leu 


Glu 


Ala 


Tyr 












Thr Lys 


Pro Asp 


Ser 


Leu 


Leu 


10 








15 


Leu Glu 


Leu Pro 


Lys 


Leu 


Leu 


25 








30 


Asn Phe 


Glu Met 


GTLn 


Pro 


Lys 


40 








45 


Leu He 


Lys Ala 


Ala 


Met 


Thr 


55 








60 


Gly Val 


Ser Thr 


Gly 


Val 


He 


70 








75 


Asp His 


Ser Ser 


Lys 


Ser 


Arg 


85 








90 


Ala Pro 


Trp Gly 


He 


Val 


Glu 


100 








105 


Asp val 


Thr Arg 


Val 


Tyr 


Gin 


115 








120 


Leu Ser 


Val Leu 


Asn 


Asn 


Ser 


130 








135 


Aen Gly 


Thr Leu 


Gly 


Lys 


Tyr 


145 








150 


Gin Leu 


Glu Lys 


His 


He 


Ser 


160 








165 


Gly Gin 


Gly Val 


Pro 


val 


Val 


175 








180 


Asn Val 


He Ser 


He 


Val 


Leu 


190 








195 


Val Pro 


Val Val 


Val 


Cys 


Asp 


205 








210 


Leu Ala 


Phe Gly 


His 


Lys 


Tyr 


220 








225 


Glu Ser 


Leu Arg 


Asp 


Gin 


Leu 


235 








240 


Thr Tyr 


Thr Arg 


Thr 


Gin 


Ala 
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245 250 255 



Gin 


His 


Leu 


Phe 


He 
260 


He 


Leu 


Met 


Glu 


Cys 
265 


Met 


Lys 


Lys 


Lys 


Glu 
270 


Leu 


He 


Thr 


Val 


Phe 

275 


Arg 


Met 


Gly 


Ser 


Glu 
280 


Gly 


His 


Gin 


Asp 


He 

285 


Asp 


Leu 


Ala 


He 


Leu 
290 


Thr 


Ala 


Leu 


Leu 


Lys 
295 


Gly 


Ala 


Asn 


Ala 


Ser 
300 


Ala 


Pro 


Asp 


Gin 


Leu 
305 


Ser 


Leu 


Ala 


Leu 


Ala 

310 


Trp 


Asn 


Arg 


Val 


Asp 
315 


He 


Ala 


Arg 


Ser 


Gin 
320 


He 


Phe 


He 


Tyr 


Gly 
325 


Gin 


Gin 


Trp 


Pro 


Val 
330 


Gly 


Ser 


Leu 


Glu 


Gin 
335 


Ala 


Met 


Leu 


Asp 


Ala 
340 


Leu 


Val 


Leu 


Asp 


Arg 
345 


Val 


Asp 


Phe 


Val 


Lys 
350 


Leu 


Leu 


He 


Glu 


Asn 
355 


Gly 


Val 


Ser 


Met 


His 
360 


Arg 


Phe 


Leu 


Thr 


He 
365 


Ser 


Arg. 


Leu 


Glu 


Glu 
370 


Leu 


Tyr 


Asn 


Thr 


Arg 
375 


His 


Gly 


Fto 


Ser 


Asn 
380 


Thr 


Leu 


Tyr 


His 


Leu 
385 


Val 


Arg 


Asp 


Val 


Lys 
390 


Lys 


Gly 


Asn 


Leu 


Pro 
395 


Pro 


Asp 


Tyr 


Arg 


He 
400 


Ser 


Leu 


He 


Asp 


He 
405 


Gly 


Leu 


Val 


He 


Glu 
410 


Tyr 


Leu 


Met 


Gly 


Gly 

415 


Ala 


Tyr 


Arg 


Cys 


Asn 
420 


Tyr 


Thr 


Arg 


Lys 


Arg 
425 


Phe 


Arg 


Thr 


Leu 


Tyr 
430 


His 


Asn 


Leu 


Phe 


Gly 
435 


Pro 


Lys 


Arg 


Asp 


Asp 
440 


He 


Pro 


Leu 


Arg 


Arg 
445 


Gly 


Arg 


Lys 


Thr 


Thr 
450 


Lys 


Lys 


Arg 


Glu 


Glu 

455 


Glu 


Val 


Asp 


He 


Asp 
460 


Leu 


Asp 


Asp 


Pro 


Glu 
465 


He 


Asn 


His 


Phe 


Pro 

470 


Phe 


Pro 


Phe 


His 


Glu 

475 


Leu 


Met 


val 


Trp 


Ala 
480 


Val 


Leu 


Met 


Lys 


Arg 
485 


Gin 


Lys 


Met 


Ala 


Leu 
490 


Phe 


Phe 


Trp 


Gin 


His 
495 


Gly 


Glu 


Glu 


Ala 


Met 
500 


Ala 


Lys 


Ala 


Leu 


Val 
505 


Ala 


Cys 


Lys 


Leu 


Cys 
510 


Lys 


Ala 


Met 


Ala 


His 
515 


Glu 


Ala 


Ser 


Glu 


Asn 
520 


Asp 


Met 


val 


Asp 


Asp 
525 


He 


Ser 


Gin 


Glu 


Leu 
530 


Asn 


His 


Asn 


Ser 


Arg 
535 


Asp 


Phe 


Gly 


Gin 


Leu 
540 


Ala 


Val 


Glu 


Leu 


Leu 
545 


Asp 


Gin 


Ser 


Tyr 


Lys 

550 


Gin 


Asp 


Glu 


Gin 


Leu 

555 


Ala 


Met 


Lys 


Leu 


Leu 
560 


Thr 


Tyr 


Glu 


Leu 


Lys 
565 


Asn 


Trp 


Ser 


Asn 


Ala 
570 


Thr 


Cys 


Leu 


Gin 


Leu 

575 


Ala 


Val 


Ala 


Ala 


Lys 
580 


His 


Arg 


Asp 


Phe 


He 
585 


Ala 


His 


Thr 


Cys 


Ser 
590 


Gin 


Met 


Leu 


Leu 


Thr 
595 


Asp 


Met 


Trp 


Net 


Gly 
600 


Arg 


Leu 


Arg 


Met 


Arg 
605 


Lys 


Asn 


Ser 


Gly 


Leu 
610 


Lys 


Val 


He 


Leu 


Gly 
615 


He 


Leu 


Leu 


Pro 


Pro 
620 


Thr 


He 


Leu 


Phe 


Leu 
625 


Glu 


Phe 


Arg 


Thr 


Tyr 
630 


Asp 


Asp 


Phe 


Ser 


Tyr 
635 


Gin 


Thr 


Ser 


Lys 


Glu 
640 


Asn 


Glu 


Asp 


Gly 


Lys 
645 


Glu 


Lys 


Glu 


Glu 


Glu 
650 


Asn 


Thr 


Asp 


Ala 


Asn 
655 


Ala 


Asp 


Ala 


Gly 


ser 
660 


Arg 


Lys 


Gly 


Asp 


Glu 

665 


Glu 


Asn 


Glu 


His 


Lys 
670 




Gin 


Arg 


Ser 


He 
675 


Pro 


He 


Gly 


Thr 


Lys 
680 


He 


Cys 


Glu 


Phe 


Tyr 
685 


Asn 


Ala 


Pro 


He 


Val 
690 


Lys 


Phe 


Trp 


Phe 


Phe 
695 


Gin 


He 


Ser 


Tyr 


Leu 
700 


Gly 


Tyr 


Leu 


Leu 


Leu 
705 


Phe 


Asn 


Tyr 


val 


He 


Leu 


Val 


Arg 


Met 


Asp 


Gly 


Trp 


Pro 


Ser 


Leu 



710 715 720 



19/83 



Gin 


Glu 


Trp 


He Val 








725 


Lys 


lie 


Arg 


Glu Val 






740 


Ala 


Arg 


Lys 


Ser Lys 








755 


Asp 


Leu 


Val 


Ala He 








770 


Leu 


Gin 


Asn 


Gin Pro 








785 


Val 


Asp 


lie 


He Phe 








800 


Val 


Asn 


Lys 


Tyr Leu 








815 


Val 


Ser 


Ser 


Gly He 








830 


Met 


Ser 


Phe 


Gly val 








845 


Lys 


Pro 


Ser 


Trp Lys 








860 


Trp 


Met 


He 


Tyr Gly 








875 


Ser 


Phe 


Phe 


Leu Ser 








890 


Glu 


Gly 


Lys 


Arg Leu 








905 


Pro 


Ala 


Leu 


Met Ala 








920 


Val 


Asn 


Leu 


Leu He 








935 


Lys 


Ser 


He 


Ser Asn 








950 


lie 


Met 


Thr 


Phe His 








965 


lie 


Leu 


Ser 


His He 








980 


Cys 


Arg 


Lys 


Lys Arg 








995 


Leu 


Ser 


Met 


Phe Leu 








1010 


Phe 


Glu 


Glu 


Gin Cys 








1025 


Glu 


Gin 


Gin 


Ser Ser 








1040 


Arg 


Val 


Glu 


Asn Met 








1055 


Glu 


Thr 


Phe 


Met Lys 








1070 


Ala 


Gin 


Leu 


Glu Glu 








1085 


Asn 


Leu 


Ala 


Gly He 








1100 


Arg 


Ala 


Ser 


Ser Glu 








1115 


Ser 


lie 


Asn 


Ser Ala 








1130 


Asn 


Gly 


Glu 


Glu Leu 








1145 


Pro 


Gly 


Thr 


Gly Val 








1160 


Glu 


Glu 


Lys 


Asp Val 








1175 


Ser 


Leu 


His 


Leu Ser 



He 


Ser 


Tyr 


He Val 








730 


Ala 


Thr 


Pro 


Lys Ala 








745 


Phe 


Gly 


Phe 


Gin Glu 








760 


Ser 


Thr 


Phe 


Met He 








775 


Tyr 


Met 


Gly 


Tyr Gly 








790 


Trp 


Tyr 


He 


Arg Val 








805 


Gly 


Pro 


Tyr 


Val Met 








820 


Leu 


Trp 


Val 


Val He 








835 


Ala 


Arg 


Gin 


Ala He 








850 


Leu 


Ala 


Arg 


Asn He 








865 


Glu 


val 


Phe 


Ala Asp 








880 


Ala 


Pro 


Cys 


Gly Glu 








895 


Pro 


Pro 


Cys 


He Pro 








910 


Cys 


Tyr 


Leu 


Leu Val 








925 


Ala 


Val 


Phe 


Ser Asn 








940 


Gin 


Val 


Trp 


Lys Phe 








955 


Asp 


Arg 


Pro 


Val Leu 








970 


Tyr 


He 


He 


He Met 








985 


Glu 


Gly 


Asp 


Gin Glu 








1000 


Ser 


Asp 


Glu 


Glu Leu 








1015 


Val 


Gin 


Glu 


His Phe 








1030 


Ser 


Asp 


Glu 


Arg He 








1045 


Ser 


Met 


Arg 


Leu Glu 








1060 


Thr 


Ser 


Leu 


Gin Thr 








1075 


Leu 


Ser 


Asn 


Arg Met 








1090 


Asp 


Arg 


Ser 


Asp Leu 








1105 


Cys 


Glu 


Ala 


Thr Tyr 








1120 


Asp 


Gly 


Tyr 


Ser Leu 








1135 


Leu 


Phe 


Glu 


Asp Thr 








1150 


Arg 


Lys 


Lys 


Thr Cys 








1165 


Lys 


Thr 


His 


Leu val 








1180 


Leu 


Gly 


Thr 


Ser Thr 



Ser 


Leu 


Ala 


Leu Glu 








735 


Ser 


Pro 


Ser 


Pro Leu 








750 


Tyr 


Trp 


Asn 


He Thr 








765 


Gly 


Ala 


He 


Leu Arg 








780 


Arg 


Val 


He 


Tyr Cys 








795 


Leu 


Asp 


He 


Phe Gly 








810 


Met 


He 


Gly 


Lys Met 








825 


Met 


Leu 


Val 


Val Leu 








840 


Leu 


His 


Pro 


Glu Glu 








855 


Phe 


Tyr 


Met 


Pro Tyr 








870 


Gin 


He 


Asp 


Arg Lys 








885 


Asn 


Leu 


Tyr 


Asp Glu 








900 


Gly 


Ala 


Trp 


Leu Thr 








915 


Ala 


Asn 


He 


Leu Leu 








930 


Thr 


Phe 


Phe 


Glu Val 








945 


Gin 


Arg 


Tyr 


Gin Leu 








960 


Pro 


Pro 


Pro 


Met He 








975 


Arg 


Leu 


Ser 


Gly Arg 








990 


Glu 


Arg 


Asp 


Arg Gly 








1005 


Lys 


Arg 


Leu 


His Glu 








1020 


Arg 


Glu 


Lys 


Glu Asp 








1035 


Arg 


val 


Thr 


Ser Glu 








1050 


Glu 


He 


Asn 


Glu Arg 








1065 


Val 


Asp 


Leu 


Arg Leu 








1080 


Val 


Asn 


Ala 


Leu Glu 








1095 


He 


Gin 


Ala 


Arg Ser 








1110 


Leu 


Leu 


Arg 


Gin Ser 








1125 


Tyr 


Arg 


Tyr 


His Phe 








1140 


Ser 


Leu 


Ser 


Thr Ser 








1155 


Ser 


Phe 


Arg 


He Lys 








1170 


Pro 


Glu 


Cys 


Gin Asn 








1185 


Ser 


Ala 


Thr 


Pro Asp 
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1190 



Gly 


Ser 


His 


Leu Ala 
1205 


Val 


Asp 


Asp 


Lys 


Leu 


Gly 


Pro Asp 
1220 


He 


Gly 


He 


Gin 


Thr 


Asp 


Ser Lys 

1235 


Lys 


Glu 


Glu 


Lys 


Thr 


Asp 


Val He 
1250 


His 


Gly 


Gin 


Thr 


Gin 


Leu 


Thr val 
1265 


Glu 


Thr 


Thr 


Tyr 


Pro 


Leu 


Glu Glu 
1280 


Thr 


Lys 


He 


Thr 


He 


Asn 


Ala Cys 
1295 


Lys 


Thr 


Met 


Ser 


Arg 


Gly 


Arg Lys 
13X0 


Leu 


Val 


Gly 


Tyr 


Ser 


Ser 


He Thr 

1325 


Asp 


Gin 


Gin 


Gin 


Val 


Gin 


Lys He 
1340 


Thr 


Arg 


Ser 


He 


val 


Ser 


Glu Ala 
1355 


Ala 


Val 


Gin 


Ala 


Asp 


Met 


Gin Asp 
1370 


Glu 


His 


His 


He 


Pro 


Arg 


Leu Ser 
1385 


Leu 


Thr 


He 


Asn 


Leu 


Leu 


Ser Val 
1400 


Lys 


Pro 


Asp 


Leu 


Arg 


Ser 


Lys Ser 
1415 


Leu 


His 


Gly 


lie 


Gin 


Gly 


Lys Leu 
1430 


Asp 


Arg 


Ser 


Ser 


Leu 


Val 


He Val 

1445 


Ser 


Gly 


Met 


Lys 


Lys 


Glu 


Lys Ala 


Ser 


Thr 


Glu 
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Met 


Thr 


Glu 


Gin 


Ala 


He 


Ser 


Phe 


1 








5 








Gly 


He 


Thr 


Ala 


Ala 
20 


He 


Ser 


Lys 


Arg 


Val 


Gin 


Leu 


Leu 
35 


Leu 


Gin 


Met 


Ala 


Ala 


Ala 


Lys 


Gin 
50 


Cys 


Lys 


Gly 


He 


Pro 


Lys 


Asp 


Gin 
65 


Gly 


Val 


Leu 


Ala 


Asn 


Val 


He 


Arg 
80 


Tyr 


Ser 


Pro 


Phe 


Lys 


Asp 


Lys 


Tyr 
95 


Lys 


Gin 


He 


His 


Thr 


Gin 


Phe 


Cys 


Arg 


Tyr 


Phe 



1195 






1200 


Leu Lys 


Asn Ala 


Glu 


Glu Ser 


1210 






1215 


Ser Lys 


Glu Asp 


Asp 


Glu Arg 


1225 






1230 


Thr He 


Ser Pro 


Ser 


Leu Asn 


1240 






1245 


Asp Lys 


Ser Asp 


Val 


Gin Asn 


1255 






1260 


Asn He 


Glu Gly 


Thr 


He Ser 


1270 






1275 


Thr Arg 


Tyr Phe 


Pro 


Asp Glu 


1285 






1290 


Lys Ser 


T^g Ser 


Phe 


Val Tyr 


1300 






1305 


Gly Val 


Asn Gin 


Asp 


Val Glu 


1315 






1320 


Leu Thr 


Thr Glu 


Trp 


Gin Cys 


1330 






1335 


His ser 


Thr Asp 


He 


Pro Tyr 


1345 






1350 


Ala Glu 


His Lys 


Glu 


Gin Phe 


1360 






1365 


Val Ala 


Glu Ala 


He 


Pro Arg 


1375 






1380 


Thr Asp 


Arg Asn 


Gly 


Met Glu 


1390 






1395 


Gin Thr 


Leu Gly 


Phe 


Pro Ser 


1405 






1410 


His Pro 


Arg Asn 


Val 


Lys Ser 


1420 






1425 


Gly His 


Ala Ser 


Ser 


Val Ser 


1435 






1440 


Thr Ala 


Glu Glu 


Lys 


Lys Val 


1450 






1455 
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Gys 
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Ala Lys 


Asp Phe 


Leu 


Ala Gly 


10 






15 


Thr Ala 


Val Ala 


Ser 


He Lys 


25 






30 


Gin His 


Ala Ser 


Met 


Pro Met 


40 






45 


He val 


Asp Cys 


He 


Val Arg 


55 






60 


Ser Phe 


Trp Arg 


Gly 


Asn Leu 


70 






75 


Thr Gin 


Ala Leu 


Asn 


Phe Ala 


85 






90 


Phe Leu 


Ala Gly 


Val 


Asp Lys 


100 






105 


Ala Gly 


Asn Leu 


Ala 


Ser Gly 
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110 



Gly Thr 


Ala Val 


val 


Tyr 


Pro 


Leu 






125 








Ala Ala 


Asp Val 


Gly 


Lys 


Ser 


Gly 






140 








Leu Gly 


Asp Cys 


Leu 


Val 


Lys 


lie 






155 








Gly Leu 


Tyr Gin 


Gly 


Phe 


Ser 


Val 






170 








Tyr Gin 


Ala Ala 


Tyr 


Phe 


Arg 


Val 






185 








Phe Pro 


Asp Pro 


Lys 


Asn 


Thr 


His 






200 








Ala Gin 


Thr val 


Thr 


Ala 


Val 


Ala 



215 

<210> 12 
<211> 461 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7483595CD1 

<400> 12 



Met 


Glu 


Thr 


Leu 


Leu 


Phe 


Leu 


Glu 


1 








5 








Gly 


Leu 


Lys 


Thr 


Phe 
20 


Tyr 


Pro 


Thr 


Leu 


lie 


Met 


Gly 


Val 


Leu 


Gly 


Leu 










35 






Pro 


Glu 


Ser 


Ala 


Met 


Ser 


Ala 


Tyr 










50 






lie 


Met 


Leu 


Ser 


Gin 
65 


Leu 


Thr 


Phe 


Phe 


His 


Ala 


Gly 


Pro 
80 


He 


Ser 


Phe 


Cys 


val 


Ala 


Leu 


Pro 

95 


Lys 


Ala 


Asn 


Leu 


Thr 


Val 


Val 


Val 
110 


Ala 


Leu 


Arg 


Ser 


Phe 


Asn 


Gin 


Tyr 
125 


Pro 


He 


Glu 


He 


He 


Leu 


Gin 


Ala 
140 


Phe 


Ser 


Leu 


Leu 


He 


Phe 


Leu 


Gly 
155 


Lys 


Lys 


Hg 


Val 


Asn 


Ser 


Asn 


Gin 
170 


Asp 


Leu 


He 


Val 


Ser 


Ser 


Phe 


Phe 
185 


Arg 


Ser 


Cys 


Arg 


Thr 


He 


He 


Gin 
200 


Asp 


Lys 


Ser 


He 


Ser 


Gly 


Arg 


Arg 
215 


Arg 


Ala 


Lys 


Pro 


Asn 


Thr 


Asn 


He 
230 


Tyr 


Arg 


Ser 


He 


Thr 


He 


Pro 


Gly 
245 


Val 


Lys 


He 


Thr 


Phe 


Val 


Asn 


Val 
260 


Tyr 


Tyr 


Leu 


Val 


Asp 


Met 


Val 


Lys 


Val 


Pro 


Leu 



115 








120 


Asp Phe 


Thr 


Arg 


Thr Arg 


Leu 


130 








135 


Thr Glu 


Arg 


Glu 


Phe Arg 


Gly 


145 








150 


Ser Lys 


Ser 


Asp 


Gly He 


Arg 


160 








165 


Ser Val 


Gin 


Ala 


He He 


He 


175 








180 


Tyr Asp 


Thr 


Ala 


Asn Gly 


Met 


190 








195 


He Leu 


Val 


Ser 


Trp Met 


Thr 


205 








210 


Gly Val 


Leu 


Ser 






220 










Ser Ala 


He 


Gly 


Ala He 


Ala 


10 








15 


Leu Val 


Phe 


Ser 


Ser Leu 


Gin 


25 








30 


Gly Phe 


He 


Ala 


Thr Tyr 


Leu 


40 








45 


Leu Ala 


Ala 


Val 


Ala Leu 


His 


55 








60 


He Phe 


Gly 


He 


Met He 


ser 


70 








75 


Phe Tyr 


Asp 


He 


He Asn 


Tyr 


85 








90 


Ser Thr 


Ser 


He 


Leu Val 


Phe 


100 








105 


He Asn 


Lys 


Cys 


He Arg 


He 


115 








120 


Phe Pro 


Met 


Glu 


Leu Phe 


Leu 


130 








135 


Ser Leu 


Val 


Ser 


Ser Phe 


Leu 


145 








150 


Ala Ser 


Leu 


His 


Asn Tyr 


Ser 


160 








165 


Ala He 


Gly 


Leu 


Cys Asn 


Val 


175 








180 


Val Phe 


Thr 


Gly 


Ala He 


Ala 


190 








195 


Gly Gly 


Ser 


Thr 


Thr val 


Cys 


205 








210 


He Leu 


Leu 


Leu 


Gly Gin 


He 


220 








225 


He Asn 


Asp 


Tyr 


Arg Glu 


He 


235 








240 


Phe Gin 


Cys 


Cys 


Ser Ser 


He 


250 








255 


Lys His 


Lys 


Leu 


Leu Lys 


Glu 


265 








270 


Lys Glu 


Glu 


Glu 


He Phe 


Ser 
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275 




Leu 


Phe 


Asn 


Ser Ser 


Asp Thr Asn 








290 




Arg 


Cys 


Phe 


Cys Asn 


Cys Asp Asp 








305 




Leu 


Tyr 


Thr 


Glu Arg 


Phe Glu Asn 








320 




Ser 


Val 


Asn 


Leu lie 


His Cys Ser 








335 




Ser 


Gin 


Thr 


Ala Ser 


Glu Asp Gin 








350 




Val 


Ser 


Gin 


Lys Asn 


Gin Gly Gin 








365 




Val 


Trp 


Leu 


Pro Asn 


Asn Ser Ser 








380 




Pro 


Asp 


Val 


Ala Glu 


Ser Gin Gly 








395 




Ser 


Asp 


Ala 


Ser Leu 


Leu Pro Ser 








410 




Phe 


Ser 


Met 


Val His 


Tyr Val Asp 








425 




Arg 


Gin 


Val 


Ser Thr 


Glu Glu Ala 








440 




Leu 


Leu 


Pro 


Ser Gin 


Pro His Pro 








455 
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Met 


Ser 


lie 


val 


Arg 


Leu 


Ser 


Val 
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Lys 


Val 


Thr 


Cly 


Thr 
20 


Lys 


Met 


Gin 


lie 


Arg 


Leu 


Val 


Leu 
35 


Asp 


Pro 


His 


Gin 


Lys 


Thr 


Tyr 


Phe 
50 


Ala 


His 


Asn 


Gly 


Asp 


He 


Val 


Leu 
65 


Leu 


Lys 


Ala 


His 


Val 


Lys 


His 


Glu 
80 


Leu 


Ala 


Glu 


Leu 


Val 
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Lys 


Glu 


Leu 


Ser 


Glu 
710 


Ala 


Arg 


He 


Arg 


Thr 
715 


Val 


Met 


He 


Thr 


Gly 
720 


Asp 


Asn 


Leu 


Gin 


Thr 
725 


Ala 


He 


Thr 


Val 


Ala 
730 


Lys 


Asn 


Ser 


Glu 


Met 
735 


lie 


Pro 


Pro 


Gly 


Ser 
740 


Gin 


Val 


He 


He 


Val 

745 


Glu 


Ala 


Asp 


Glu 


Pro 

750 


Glu 


Glu 


Phe 


Val 


Pro 

755 


Ala 


Ser 


Val 


Thr 


Trp 
760 


Gin 


Leu 


Val 


Glu 


Asn 
765 


Gin 


Glu 


Thr 


Gly 


Pro 
770 


Gly 


Lys 


Lys 


Glu 


He 
775 


Tyr 


Met 


His 


Thr 


Gly 
780 


Asn 


Ser 


Ser 


Thr 


Pro 
785 


Arg 


Gly 


Glu 


Gly 


Gly 
790 


Ser 


Cys 


Tyr 


His 


Phe 
795 


Ala 


Net 


Ser 


Gly 


Lys 
800 


Ser 


Tyr 


Gin 


Val 


He 
805 


Phe 


Gin 


His 


Phe 


Asn 

810 


Ser 


Leu 


Leu 


Pro 


Lys 
815 


He 


Leu 


Val 


Asn 


Gly 
820 


Thr 


Val 


Phe 


Ala 


Arg 
825 


Met 


Ser 


Pro 


Gly 


Gin 
830 


Lys 


Ser 


Ser 


Leu 


He 
835 


Glu 


Glu 


Phe 


Gin 


Lys 
840 


Leu 


Asn 


Ala 


Cys 


Thr 
845 


val 


Gin 


Asn 


Glu 


Ser 
850 


He 


Ser 


Glu 


Leu 


Thr 
855 


Met 


Ser 


Pro 


Thr 


Ala 
860 


Pro 


Glu 


Lys 


Met 


Glu 
865 


Ser 


Asn 


Ser 


Thr 


Phe 
870 


fhr 


Ser 


Phe 


Glu 


Asn 
875 


Thr 


Thr 


val 


Trp 


Phe 
880 


Leu 


Gly 


Thr 


He 


Asn 
885 


Cys 


He 


Thr 


Val 


Ala 
890 


Leu 


Val 


Phe 


Ser 


Lys 
895 


Gly 


Lys 


Pro 


Phe 


Arg 
900 


Gin 


Pro 


Thr 


Tyr 


Thr 
905 


Asn 


Tyr 


He 


Phe 


Val 
910 


Leu 


Val 


Leu 


He 


He 
915 


Gin 


Leu 


Gly 


Val 


Cys 
920 


Leu 


Phe 


He 


Leu 


Phe 

925 


Ala 


Asp 


He 


Pro 


Glu 
930 


Leu 


Tyr 


Arg 


Arg 


Leu 
935 


Asp 


Leu 


Leu 


Cys 


Thr 
940 


Pro 


val 


Leu 


Trp 


Arg 
945 


Ala 


Ser 


He 


Val 


He 
950 


Met 


Leu 


Ser 


Leu 


Asn 
955 


Phe 


He 


Val 


Ser 


Leu 
960 
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Val Ala 61u 


Glu Ala 


Val 


He 


Glu 


Asn Arg 


Ala Leu Trp Met Met 




965 










970 




975 


He Lys Arg 


Cys Phe 


Gly Tyr Gin 


Ser Lys 


Ser Gin Tyr Arg He 














985 




990 


Trp Gin Arg 


Asp Leu 


Ala Asn Asp 


Pro 


Ser 


Trp Pro Pro Leu 


Asn 




995 








1000 


1005 


Gin Thr Ser 


His Ser 


Asp Met Pro 


Glu Cys 


Gly Arg Gly Val Ser 




1010 








1015 


1020 


Tyr Ser Asn 


Pro Val 


Phe 


Glu 


Ser 


Asn 


Glu 


Glu Gin Leu 






1025 








1030 
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Met Asp Cys 


Tyr Arg 


Thr 


Ser 


Leu 


Ser 


Ser 


ii>^Jm JLX9 kjft 


pro 


1 


5 










10 


1 ^ 


Thr Val He 


Leu Cys 


Leu 


Phe 


Gly 


Phe 


Phe 


S^r* Mp>t- Met- Ai"CT 
fici,. incst. 






20 










25 




J V 


Ser Glu Pro 


Phe Leu 


He 


Pro 


Tyr 


Leu 


Ser 


\r l^v^rk Ann Tatq 
UXj^ irxu Aoj^ Xjjfo 


Asn 




35 










40 






Leu Thr Ser 


Ala Qlu 


He 


Thr 


Asn 


Glu 


He 


IrilC Ir^U VClJL A.L^ 


rnV. 




50 










55 


fin 


Tyr Ser Tyr 


Leu Val 


Leu 


Leu 


Leu 


Pro 


Val 


ITllcr VclX IjtSU AiXX^ 






65 










70 




75 


Tyr Val Arg 


Tyr Lys 


Pro 


Val 


He 


He 


Leu 


\^±n \9J.y xj-ts oi^-ir 






80 










85 




90 


He He Thr 


Trp Leu 


Leu 


Leu 


Leu 


Phe 


Gly 


Gin filv Val T.vk 

\SlJ^jf VO.A. XJ y b 






95 










100 




105 

X U J 


Met Gin Val 


Val Glu 


Phe 


Phe 


Tyr 


Gly 


Met 


Val THt Alfl Ala 


V3XU 




110 










115 




120 


Val Ala Tyr 


Tyr Ala 


Tyr 


He 


Tyr 


Ser 


Val 


Val Ser Pro Glu 


His 




125 










130 




135 


Tyr Gin Arg 


Val Ser 


Gly 


Tyr 


Cys 


Arg 


Ser 


Val Thr Leu Ala 


Ala 




140 










145 




150 


Tyr Thr Ala 


Gly Ser 


Val 


Leu 


Ala 


Gin 


Leu 


Leti Val Ser Leu 


hla 




155 










160 




165 


Asn Met Ser 


Tyr Phe 


Ty^c 


Leu 


Asn 


Val 


He 


Ser Leu Ala Ser 


Val 




170 










175 




ISO 


Ser Val Ala 


Phe Leu 


Phe 


Ser 


Leu 


Phe 


Leu 


Pro Met Pro Lys 


Lys 




185 










190 


195 


Ser Met Phe 


Phe His 


Ala 


Lys 


Pro 


Ser 


Arg 


Glu He Lys Lys 


Ser 




200 










205 




210 


Ser Ser Val 


Asn Pro 


Val 


Leu 


Glu 


Olu 


Thr 


His Glu Gly Glu 


Ala 




215 










220 


225 


Pro Gly Cys 


Glu Glu 


Gin 


Lys 


Pro 


Thr 


Ser 


Glu He Leu Ser 


Thr 




230 










235 




240 


Ser Gly Lys 


Leu Asn 


Lys 


Gly 


Gin 


Leu 


Asn 


Ser Leu Lys Pro 


Ser 




245 










250 




255 


Asn Val Thr 


Val Asp 


Val 


Phe 


Val 


Gin 


Trp 


Phe Gin Asp Leu 


Lys 




260 










265 




270 


Glu Cys Tyr 


Ser Ser 


Lys 


Arg 


Leu 


Phe 


Tyr 


Trp Ser Leu Trp 


Trp 




275 










280 


285 


Ala Phe Ala 


Thr Ala 


Gly 


Phe 


Asn 


Gin 


Val 


Leu Asn Tyr Val 


Gin 




290 










295 


300 


He Leu Trp 


Asp Tyr 


Lys 


Ala 


Pro 


Ser 


Gin 


Asp Ser Ser He 


Tyr 




305 










310 




315 
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Asn 


Gly 


Ala 


Val Glu 
320 


Ala 


Ile 


Ala 


Ala 


Phe 


Ala 


Val Gly 
335 


Tyr 


Val 


Lys 


Glu 


Leu 


Ala 


Leu Val 
350 


Val 


Phe 


Ser 


Phe 


Leu 


Met 


His Tyr 
365 


Thr 


Ala 


Asn 


Tyr 


Leu 


ile 


Phe Lys 
380 


Ser 


Ser 


Tyr 


Val 


Phe 


Gin 


Ile Ala 
395 


Val 


Asn 


Leu 


Val 


Phe 


Gly 


Ile Asn 
410 


Thr 


Phe 


He 


Met 


Thr 


Val 


Ile Val 
425 


Val 


Asp 


Gin 


Ser 


He 


Gin 


Phe Leu 
440 


Val 


Tyr 


Gly 


Gly 


He 


Phe 


Leu Met 
455 


Arg 


Ser 


Met 


Ser 


Gin 


Lys 


Asp Val 
470 


Gin 


Ser 


Pro 


Val 


Ser 


His 


Pro Glu 
485 


Glu 


Glu 


Ser 



Leu 
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Met 


Thr 


Asn 


Ser 


Lys 


Gly 


Arg 


Ser 


1 








5 








Gly 


Pro 


Ser 


Ser 


Gly 
20 


Gly 


Gly 


Phe 


Asn 


Thr 


He 


Gin 


Ser 
35 


Asp 


Lys 


Phe 


Val 


Pro 


Val 


He 


Tyr 
50 


Gin 


Lys 


Asn 


Pro 


Asn 


Gly 


He 


Trp 
65 


Gin 


Asp 


Gly 


Phe 


Ser 


Asn 


Arg 


Ser 


Glu 


Gin 


His 








80 








Glu 


Gin 


Ser 


Phe 


His 
95 


Ala 


Asn 


Asn 


Ser 


Gin 


Lys 


Tyr 


Asp 

110 


Asp 


Tyr 


Ala 


Arg 


Glu 


Thr 


Ser 


Glu 
125 


Thr 


Thr 


Ala 


Ser 


Ser 


Asp 


Gly 


Asp 
140 


Glu 


Asp 


Ala 


Leu 


Pro 


Lys 


Glu 


Ser 
155 


Ser 


Gly 


lie 


Pro 


Phe 


Leu 


Leu 


Ala 
170 


Gly 


Phe 


Gly 


Leu 


Asp 


He 


Val 


Gin 


His 


Trp 


Glu 



185 



Thr Phe 


Gly Gly Ala Val 


Ala 


325 




330 


Val Asn 


Trp Asp Leu Leu 


Gly 


340 




345 


Val Val 


Asn Ala Gly Ser 


Leu 


355 




360 


He Trp 


Ala Cys Tyr Ala 


Gly 


370 




375 


Met Leu 


Leu He Thr He 


Ala 


385 




390 


Asn Val 


Glu Arg Tyr Ala 


Leu 


400 




405 


Ala Leu 


Val Ile Gin Thr 


He 


415 




420 


Arg Gly 


Leu Asn Leu Pro 


Val 


430 




435 


Ser Tyr 


Phe Ala Val He 


Ala 


445 




450 


Tyr lie 


Thr Tyr Ser Thr 


I'ys 


460 




465 


Ala Pro 


Ser Glu Asn Pro 


Asp 


475 




480 


Asn lie 


lie Met Ser Thr 


Lys 








He Thr 


Asp Lys Thr Ser* 


Gly 


10 




15 


Val Asp 


Trp Thr Leu Arg 


Leu 


25 




30 


Leu Asn 


Leu Leu Leu Ser 


Met 


40 




45 


Gin Glu 


Asp Arg His Lys 


Lys 


55 




60 


l*eu Ser 


Thr Ala Val Gin 


Thr 


70 




75 


Met Glu 


Tyr His Ser Phe 


Ser 


85 




90 


Gly His 


Ala Ser Ser Ser 


Cys 


100 




105 


Asn Tyr 


Asn Tyr Cys Asp 


Gly 


115 




120 


Met Leu 


Gin Asp Glu Asp 


He 


130 




135 


He Val 


Glu Val Thr Pro 


Lys 


145 




150 


Met Ala 


Leu Gin lie Leu 


Val 


160 




165 


Thr Val 


Ser Ala Gly Met 


Val 


175 




180 


Val Phe 


Arg Lys Val Thr 


Glu 


190 




195 
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Val 


Phe 


lie 


Leu 


Val Pro Ala 


Leu Leu Gly Leu Lys Gly Asn 


Leu 










200 




205 


210 


Glu 


Met 


Thr 


Leu 


Ala Ser Arg 


Leu Ser Thr Ala Val Asn He 


Gly 










215 




220 


225 


Lys 


Met 


Asp 


Ser 


Pro He 


Glu 


Lys Trp Asn Leu He He Gly 


Asn 










230 




235 


240 


Leu 


Ala 


Leu 


Lys 


Gin Val 


Gin 


Ala Thr Val Val Gly Phe Leu 


Ala 










245 




250 


255 


Ala 


Val 


Ala 


Ala 


He He 


Leu 


Gly Trp He Pro Glu Gly Lys 


Tyr 










260 




265 


270 


Tyr 


Leu 


Asp 


His 


Ser He 


Leu 


Leu Cys Ser Ser Ser Val Ala 


Thr 










275 




280 


285 


Ala 


Phe 


He 


Ala 


Ser Leu 


Leu 


Gin Gly He He Met Val Gly 


Val 










290 




295 


300 


lie 


Val 


Gly 


Ser 


Lys Lys 


Thr 


Gly He Asn Pro Asp Asn val 


Ala 










305 




310 


315 


Thr 


Pro 


lie 


Ala 


Ala ser 


Phe 


Gly Asp Leu He Thr Leu Ala 


He 










320 




325 


330 


Leu 


Ala 


Trp 


He 


Ser Gin Gly 


Leu Tyr Ser Cys Leu Glu Thr 


Tyr 










335 




340 


345 


Tyr 


Tyr 


He 


Ser 


Pro Leu Val 


Gly Val Phe Phe Leu Ala Leu 


Thr 










350 




355 


360 


Pro 


He 


Trp 


He 


He He 


Ala 


Ala Lye His Pro Ala Thr Arg 


Thr 










365 




370 


375 


Val 


Leu 


His 


Ser 


Gly Trp Glu 


Pro val He Thr Ala Met Val 


He 










380 




385 


390 


Ser 


Ser 


He 


Gly 


Gly Leu He 


Leu Asp Thr Thr Val Ser Asp 


Pro 










395 




400 


405 


Asn 


Leu 


Val 


Gly 


He Val 


Val 


Tyr Thr Pro Val He Asn Gly 


He 










410 




415 


420 


Gly 


Gly 


Asn 


Leu 


val Ala 


He 


Gin Ala Ser Arg He Ser Thr 


Tyr 










425 




430 


435 


Leu 


His 


Leu 


His 


Ser He 


Pro 


Gly Glu Leu Pro Aep Glu Pro 


Lys 










440 




445 


450 


Gly 


Cys 


Tyr 


Tyr 


Pro Phe Arg 


Thr Phe Phe Gly Fro Gly Val 


Asn 










455 




460 


465 


Asn 


Lys 


Ser 


Ala 


Gin Val Leu 


Leu Leu Leu Val He Pro Gly 


His 










470 




475 


480 


Leu 


lie 


J^he 


Leu 


Tyr Thr He 


His Leu Met Lys Ser Gly His 


Thr 










485 




490 


495 


Ser 


Leu 


Thr 


He 


He Phe 


He 


Val Val Tyr Leu Phe Gly Ala 


Val 










500 




505 


510 


Leu 


Gin 


Val 


Phe 


Thr Leu 


Leu 


Trp He Ala Asp Trp Met Val 


His 










515 




520 


525 


His 


Phe 


Trp 


Arg 


Lys Gly Lys 


Asp Pro Asp Ser Phe Ser He 


Pro 










530 




535 


540 


Tyr 


Leu 


Thr 


Ala 


Leu Gly Asp 


Leu Leu Gly Thr Ala Leu Leu 


Ala 










545 




550 


555 


Leu 


Ser 


Phe 


His 


Phe Leu Trp 


Leu He Gly Asp Arg Asp Gly 


Asp 










560 




565 


570 


Val 


Gly 


Asp 
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Met 


Asp 


Ala 


Val 


-Lys 


Tyr 


Leu 


1 








5 






Glu 


Leu 


Thr 


Lys 


Tyr 
20 


Leu 


Phe 


Lys 


Thr 


Thr 


lie 


Ser 


Ser 


Phe 








35 






Gly 


Lys 


Lys 


Val 


Ala 
50 


Leu 


val 


Gin 


Asp 


Val 


Phe 


Gin 
65 


Met 


Glu 


Gin 


Pro 


lie 


Pro 


Asn 
80 


Leu 


Ser 


Ala 


Ala 


Asp 


Asp 


Tyr 
95 


Lys 


Ala 


lie 


Leu 


Pro 


Glu 


Asp 

110 


Val 


Leu 


Gly 


Ser 


Cys 


Thr 


Val 

125 


Glu 


Val 


Phe 


Leu 


Ser 


Asp 


Lys 
140 


Thr 


Leu 


Phe 


Asp 


Thr 


Ala 


Pro 
155 


Thr 


Gly 


Pro 


Ser 


Ala 


Trp 


Thr 
170 


Asp 


Tyr 


Ser 


Cys 


Leu 


Gly 


Gin 
185 


Leu 


Ser 


Tyr 


Asn 


Ser 


Ala 


Leu 
200 


Glu 


Lys 


Met 


Met 


Leu 


Val 


Ala 
215 


Arg 


Pro 


Gin 


Arg 


Ala 


Gin 


Gin 
230 


Glu 


Leu 


Lys 


val 


lie 


lie 


Asn 
245 


Asn 


Tyr 


Ser 


Ser 


Gin 


Met 


Lys 
260 


Ser 


Glu 


Thr 


Glu 


Trp 


Leu 


Asn 
27S 


Asn 


Asn 


Lys 


Gin 


Lys 


Glu 


Glu 
290 


Gly 


He 


Asp 


Asp 


Asn 


Leu 


lie 
305 


Glu 


Asn 


Pro 


Gin 


Phe 


Asn 


Lys 
320 


Leu 


He 


Gin 


Tyr 


Leu 


Phe 


Thr 

335 


Met 


Gly 


Val 


Ala 


Thr 


Gin 


Leu 
350 


Ala 


Thr 


Val 


Leu 


Leu 


Ala 


Thr 
365 


Thr 


Asp 


Thr 


Thr 


Ser 


Asn 


Leu 
380 


Asn 


Thr 


Leu 


Glu 


Lys 


Tyr 


Lys 
395 


Lys 


Glu 


Thr 


Pro 


Gin 


Asp 


Asp 
410 


lie 


Asp 


Pro 


Cys 


Thr 


Glu 


Glu 
425 


lie 


Ala 


Met 


Glu 


Asn 


Gin 


Asp 
440 


Asp 


Met 


Pro 


Thr 


Gly 


His 


Thr 
455 


Leu 


Leu 


His 


Arg 


Glu 


Leu 


Lys 


Lys 


Lys 



Asn 


Lys 


Leu 
10 


Asn 


Leu 


Asp 


Asn 


He 
15 


Phe 


Thr 


Gly 
25 


Lys 


Gly 


Gly 


Val 


Gly 
30 


He 


Ala 


Leu 
40 


Asn 


Leu 


Ala 


Glu 


Asn 
45 


Ser 


Thr 


Asp 
55 


Pro 


Ala 


Ser 


Asn 


Leu 
60 


Leu 


Ser 


Asn 
70 


Lys 


Leu 


Thr 


Lys 


Tyr 
75 


He 


Ala 


Asn 
85 


Phe 


Asp 


Pro 


He 


Val 
90 


Gin 


Ser 


He 
100 


Glu 


Pro 


Tyr 


Glu 


Gly 
105 


Ala 


Glu 


Met 
115 


Lys 


Glu 


Gin 


Leu 


Ser 
120 


Ala 


Ala 


Phe 
130 


Asn 


Glu 


Phe 


Thr 


Asn 
135 


Glu 


Gin 


Glu 
145 


Phe 


Asp 


Phe 


He 


He 

150 


His 


Thr 


Leu 
160 


Arg 


Met 


Leu 


Glu 


Leu 
165 


Leu 


Asn 


Thr 
175 


Thr 


Ser 


Asn 


Asp 


Ala 
180 


Gly 


Leu 


Asn 
190 


Glu 


Asn 


Arg 


Val 


Lys 
195 


Leu 


Arg 


Asn 
205 


Gin 


Asp 


Asp 


Thr 


Thr 
210 


Thr 


His 


Ser 
220 


Ser 


He 


Tyr 


Glu 


He 
225 


Gin 


Gin 


Leu 

235 


Ser 


He 


Ser 


Lys 


Phe 
240 


He 


Glu 


Glu 

250 


ser 


His 


Gly 


Leu 


He 
255 


Gin 


Asp 


Lys 
265 


Asn 


He 


Asn 


His 


Phe 
270 


His 


Ala 


Tyr 
280 


Tyr 


Val 


Pro 


Tyr 


Lys 
285 


Glu 


Asn 


Leu 
295 


Thr 


Asn 


Leu 


Leu 


Asn 
300 


Asp 


Asp 


Phe 
310 


He 


Val 


Glu 


Asp 


His 
315 


Asp 


Glu 


He 

325 


Glu 


Asn 


Ser 


Lys 


Val 

330 


Lys 


Gly 


Gly 
340 


Val 


Gly 


Lys 


Thr 


Thr 
345 


Ala 


Leu 


Ser 

355 


Asn 


Lys 


Gly 


Tyr 


Arg 

360 


Pro 


Thr 


Lys 
370 


Glu 


He 


Asn 


Val 


Glu 
375 


Ala 


Tyr 


He 
385 


Asp 


Glu 


Glu 


Gin 


Ala 
390 


Val 


Leu 


Ala 
400 


Thr 


Val 


Asn 


Asp 


Asp 
405 


Tyr 


He 


Met 


Glu 


Asp 


Leu 


Lys 


Ser 




415 










420 


Phe 


Phe 


Lys 
430 


Ala 


Phe 


Ser 


Asp 


He 
435 


Asp 


Tyr 


Val 
445 


He 


Val 


Asp 


Thr 


Ala 
450 


Leu 


Leu 


Asp 
460 


Ser 


Ser 


Glu 


Asn 


His 
465 


Ser 


Thr 


Gin 


Thr 


Thr 


Ser 


Asn 


val 
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470 








Glu 


Thr 


Leu 


Leu 


Pro 
485 


Lys 


He 


Gin 


He 


lie 


Val 


Thr 


Leu 
500 


Ala 


Glu 


Lys 


Arg 


Leu 


Val 


Glu 


Asp 

515 


Leu 


Asn 


Arg 


Trp 


Val 


Val 


Asn 


Gin 
530 


Ser 


Leu 


Val 


Leu 


Phe 


Ser 


Asn 


Lys 
545 


Lys 


Glu 


Asp 


He 


Lys 


Asn 


Glu 


Ser 
560 


Phe 


Asp 


Asn 


Gly 


Leu 


Ser 
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Met 


Asn 


Ser 


Asn Leu 


Asn Leu Asp 


1 






5 




Val 


Met 


Asn 


Ala He 


Ser Ser Leu 








20 




Phe 


Leu 


Glu 


Tyr Phe 


Ser Thr Cys 








35 




Gly 


Ser 


Phe 


Leu Ser 


Thr Cys He 








50 




Ala 


Leu 


Ser 


Val Met 


He Ala Gly 








65 




His 


Phe 


Pro 


Ala Val 


Glu Gin Pro 








80 




val 


Ser 


Ser 


Met Pro 


Cys Phe Tyr 








95 




Leu 


Gly 


Val 


Ala Glu 


Thr Leu Val 
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Ser 


Tyr 


Arg 


Phe Val 


Pro Ser Asn 








125 




Phe 


Leu 


Thr 


Leu Phe 


Asn Gly Phe 








140 




Leu 


Val 


Lys 


Leu Val 


Tyr Leu He 








155 




Trp 


Phe 


Pro 


Asn Thr 


Leu Asn Lys 








170 




Ph© 


Phe 


Leu 


Ala Ser 


Leu Thr Leu 








185 




Ser 


Val 


Ser 


Gin Arg 


Tyr Cys Asn 








200 




Asn 


He 


Arg 


Gly Ser 


Asn Leu Glu 








215 




Lys 


Ser 


Leu 


Lys Phe 


Tyr Gly Ser 








230 




Xle 


Asp 


Leu 


Trp Glu 


Thr Ala Leu 








245 
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475 




480 


Asn Lys 


Asn Leu 


Thr Gin Met 




490 




495 


Thr 


Pro 


Tyr Leu 


Glu Ser Lys 




505 




510 


Ala 


Asn 


He Gly 


His Asn Trp 




520 




525 


Thr 


Leu 


Asn Gin 


Arg Asp Asp 




535 




540 


Glu 


Ser 


Phe Trp 


He Asn Lys 




550 




555 


Tyr 


Phe 


Val He 


Pro Tyr Gly 




565 




570 



Gly Phe 


Leu Leu 


Pro He Ala 


10 




15 


Pro Leu 


Leu He 


Leu Ala Pro 


25 




30 


Leu Phe 


Pro Ser 


Lys Arg Val 


40 




45 


He Ala 


Gly Asn 


Leu Phe Ala 


55 




60 


Phe Phe 


Glu He 


His Arg Lys 


70 




75 


Leu Ser 


Gly Lys 


Val Leu Thr 


85 




90 


Leu He 


Leu Gin 


Tyc Val Leu 


100 




105 


Asn Pro 


Ala Leu 


Ser Val He 


115 




120 


Val Arg 


Gly Thr 


Ser Met Asn 


130 




135 


Gly Cys 


Phe Thr 


Gly Ala Leu 


145 




150 


Ser Glu 


Gly Lys 


Asn Arg Gin 


160 




165 


Gly Asn 


Leu Glu 


Gly Phe Phe 


175 




180 


Leu Asn 


Val Leu 


Gly Phe Cys 


190 




195 


Leu Asn 


His Phe 


Asn Ala Gin 


205 




210 


Glu Thr 


Leu Leu 


Leu His Glu 


220 




225 


He Gin 


Glu Phe 


Ser Ser Ser 


235 




240 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> ionisc^feature 

<223> Incyte ID NO: 5508353CD1 

<400> 21 



Met 


Lys 


Pro 


Ala 


Ser 


Pro 


val 


Glu 


Glu 


Glu 


Val 


Ser 


Gin 


Val 


Cys 


1 








5 










10 










15 


Glu 


Ser 


Pro 


Gin 


Cys 


Ser 


Ser 


Ser 


Ser 


Ala 


Cys 


Cys 


Thr 


Glu 


Thr 










20 










25 










30 


Glu 


Lys 


Gin 


His 


Gly 


Asp 


Ala 


Gly 


Leu 


Leu 


Asn 


Gly 


Lys 


Ala 


Glu 










35 










40 










45 


Ser 


Leu 


Pro 


Gly 


Gin 


Pro 


Leu 


Ala 


Cys 


Asn 


Leu 


Cys 


Tyr 


Glu 


Ala 










50 










55 










60 


Glu 


Ser 


Pro 


Asp 


Glu 


Ala 


Ala 


Leu 


Val 


Tyr 


Ala 


Ala 


Arg 


Ala 


Tyr 










65 










70 










75 


Gin 


Cys 


Thr 


Leu 


Arg 


Ser 


Arg 


Thr 


Pro 


Glu 


Gin 


Val 


Met 


Val 


Asp 










80 










85 










90 


Phe 


Ala 


Ala 


Leu 


Gly 


Pro 


Leu 


Thr 


Phe 


Gin 


Leu 


Leu 


His 


He 


Leu 










95 










100 










105 


Pro 


Phe 


Asp 


Ser 


Val 


Arg 


Lya 


Arg 


Met 


Ser 


Val 


Val 


Val 


Arg 


His 










110 










115 










120 


Pro 


Leu 


Ser 


Asn 


Gin 


VaX 


Val 


Val 


Tyr 


Thr 


Lys 


Gly 


Ala 


Asp 


Ser 










125 










130 










135 


Val 


He 


Met 


Glu 


Leu 


Leu 


Ser 


Val 


Ala 


Ser 


Pro 


Asp 


Gly 


Ala 


Ser 










140 










145 










150 


Leu 


Glu 


Lys 


Gin 


Gin 


Met 


He 


Val 


Arg 


Glu 


Lys 


Thr 


Gin 


Lys 


His 










155 










160 










165 


Leu 


Asp 


Asp 


Tyr 


Ala 


Lys 


Gin 


Gly 


Leu 


Arg 


Thr 


Leu 


Cys 


He 


Ala 










170 










175 










180 


Lys 


Lys 


Val 


Het 


Ser 


Asp 


Thr 


Glu 


Tyr 


Ala 


Glu 


Trp 


Leu 


Arg 


Asn 










185 










190 










195 


HiB 


Phe 


Leu 


Ala 


Glu 


Thr 


Ser 


He 


Asp 


Asn 


Arg 


Glu 


Glu 


Leu 


Leu 










200 










205 








210 


Leu 


Glu 


Ser 


Ala 


Met 


Arg 


Leu 


Glu 


Asn 


Lys 


Leu 


Thr 


Leu 


Leu 


Gly 










215 










220 










225 


Ala 


Thr 


Gly 


lie 


Glu 


Asp 


Arg 


Leu 


Gin 


Glu 


Gly 


val 


Pro 


Glu 


Ser 










230 










235 










240 


lie 


Glu 


Ala 


Leu 


His 


Lys 


Ala 


Gly 


He 


Lys 


He 


Trp 


Met 


Leu 


Thr 










245 










250 










255 


Gly 


Asp 


Lys 


Gin 


Glu 


Thr 


Ala 


Val 


Asn 


He 


Ala 


Tyr 


Ala 


Cys 


Lys 










260 










265 










270 


Leu 


Leu 


Glu 


Pro 


Asp 


Asp 


Lys 


Leu 


Phe 


He 


Leu 


Asn 


Thr 


Gin 


Ser 










275 










280 










285 


Lys 


Asp 


Ala 


Cys 


Gly 


Met 


Leu 


Met 


Ser 


Thr 


He 


Leu 


Lys 


Glu 


Leu 










290 










295 










300 


Gin 


Lys 


Lys 


Thr 


Gin 


Ala 


Leu 


Pro 


Glu 


Gin 


Val 


Ser 


Leu 


Ser 


Glu 










305 










310 










315 


Asp 


Leu 


Leu 


Gin 


Pro 


Pro 


Val 


Pro 


Arg 


Asp 


Ser 


Gly 


Leu 


Arg 


Ala 










320 










325 










330 


Gly 


Leu 


He 


He 


Thr 


Gly 


Lys 


Thr 


Leu 


Glu 


Phe 


Ala 


Leu 


Gin 


Glu 










335 










340 










345 


Ser 


Leu 


Gin 


Lys 


Gin 


Phe 


Leu 


Glu 


Leu 


Thr 


Ser 


Trp 


Gly 


Gin 


Ala 










350 










355 






360 


VAl 


Val 


Cys 


Cys 


Arg 


Ala 


Thr 


Pro 


Leu 


Gin 


Lys 


Ser 


Glu 


Val 


Val 










365 










370 








375 


Lys 


Leu 


Val 


Arg 


Ser 


His 


Leu 


Gin 


Val 


Met 


Thr 


Leu 


Ala 


He 


Gly 










380 










385 










390 


Asp 


Gly 


Ala 


Asn 


Asp 


Val 


Ser 


Met 


He 


Gin 


Val 


Ala 


Asp 


He 


Gly 










395 










400 








405 


lie 


Gly 


val 


Ser 


Gly 


Gin 


Glu 


Gly 


Met 


Gin 


Ala 


Val 


Met 


Ala 


Ser 
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410 



Asp 


Phe 


Ala Val 


Ser 
425 


Gin 


Phe 


Lys 


val 


His 


Gly His 


Trp 
440 


Cys 


Tyr 


Thr 


Tyr 


Phe 


Phe Tyr 


Lye 
455 


Asn 


Val 


Ala 


Tyr 


Gin 


Phe Phe 


Cys 
470 


Gly 


Phe 


Ser 


Trp 


Val 


Leu lie 


Phe 
485 


Phe 


Asn 


Leu 


Val 


He 


Tyr Gly 


Val 
500 


Leu 


Glu 


Lys 


Met 


Gin 


Leu Pro 


Glu 
515 


Leu 


Tyr 


Lys 


Tyr 


Leu 


Pro His 


Thr 
530 


Phe 


Trp 


He 


Gin 


Ser 


Leu Val 


Cys 
545 


Phe 


Phe 


val 


Ser 


Asp 


Thr Asp 


He 
560 


Phe 


Ala 


Phe 


Ala 


Leu 


Phe He 


Val 
575 


Leu 


Leu 


His 


Leu 


Thr 


Trp He 


His 
590 


Leu 


Leu 


Val 


Tyr 


Phe 


Leu Phe 


Ala 
605 


He 


Val 


Phe 


Asn 


Pro 


Pro Ser 


Asn 

620 


Pro 


Tyr 


Trp 


Asp 


Pro 


Val Phe 


Tyr 
635 


Leu 


val 


cys 


Leu 


Leu 


Pro Arg 


Phe 
650 


Val 


Tyr 


Arg 


Pro 


Ser 


Pro He 


Leu 
665 


Arg 


Ala 


Lys 


Glu 


Glu 


Arg Thr 


Lys 
680 


Ala 


Leu 


Lys 


Met 


Asn 


Gin Val 


Thr 
695 


Ser 


Lys 


Tyr 


Ser 


Qly 


Arg Arg 


Pro 
710 


Met 


Pro 


Gly 


Lys 


Ser 


Ala Ser 


Ser 
725 


Cys 


Ala 


He 


Cys 


Glu 


Thr Ala 


Leu 
740 


Asp 


Gin 


Gly 


Glu 


Met 


Ala Gly 


Pro 


Ser 


Lys 


Gly 



755 

<210> 22 

<211> 219 

<212> PRT 

<213> HonK> sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 8543628CD1 

<400> 22 

Met Ala Ser Ser Gly Leu Glu Leu 

1 5 
Leu Gly Lys Gly Ala Ala Gin Thr 
20 

He Gly Phe Ser Thr Val Gly Gly 



415 








420 


His Leu 


Ser 


Lys Leu 


Leu 


Leu 


430 








435 


Arg Leu 


Ser 


Asn Met 


He 


Leu 


445 








450 


Tyr Val 


Asn 


Leu Leu 


Phe 


Trp 


460 








465 


Gly Thr 


Ser 


Met Thr 


Asp 


Tyr 


475 








480 


Leu Phe 


Thr 


Ser Ala 


Pro 


Pro 


490 








495 


Asp Val 


Ser 


Ala Glu 


Thr 


Leu 


505 








510 


Ser Gly 


Gin 


Lys Ser 


Glu 


Ala 


520 








525 


Thr Leu 


Leu 


Asp Ala 


Phe 


Tvr 


535 








540 


Pro Tyr 


Phe 


Thr Tyr 


Gin 


Gly 


550 








555 


Gly Asn 


Pro 


Xieu Asn 


Thr 


Ala 


565 








570 


Leu Val 


He 


Glu Ser 


Lvs 


Ser 


580 








585 


He He 


Gly 


Ser He 


Leu 


Ser 


595 








600 


Gly Ala 


Met 


Cys Val 


Thr 


Cys 


610 








615 


He Met 


Gin 


Glu His 


Met 


Leu 


625 








630 


He Leu 


Thr 


Thr Ser 


He 


Ala 


640 








645 


Val Leu 


Gin 


Gly Ser 


Leu 


Phe 


655 








660 


His Phe 


Asp 


Arg Leu 


Thr 


Pro 


670 








675 


Lys Trp 


Ara 


Glv Ala 


Glv 


Lys 


685 








690 


Ala Asn 


Gin 


Ser Ala 


Glv 


Lys 


700 








705 


pro Ser 


Ala 


Val Phe 


Ala 


Met 


715 








720 


Glu Gin 


Gly 


Asn Leu 


Ser 


Leu 


730 








735 


Tyr Ser 


Glu 


Thr Lys 


Ala 


Phe 


745 








750 


Lys Glu 


Ser 








760 










Leu Trp 


Val 


Ser Leu 


Pro 


Gin 


10 








15 


Leu Ser 


He 


Ser Phe 


Leu 


Ser 


25 








30 


Val Leu 


Tyr 


Gly Val 


Leu 


Arg 
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35 



Thr 


Leu 


Asn 


Asn Lys 
50 


Ala 


lie 


Adn 


Glu 


Leu 


Phe 


Arg Ala 
65 


He 


Pro 


Val 


Phe 


Phe 


Gly 


Val Pro 
80 


He 


Phe 


Phe 


Trp 


Cys 


Ala 


Val Leu 
95 


Val 


Leu 


Ser 


Gly 


Glu 


Val 


Val Arg 
110 


Gly 


Ala 


Leu 


Arg 


Glu 


Ala 


Gly Leu 
125 


Ser 


He 


Gly 


Gly 


Tyr 


Val 


Leu Leu 
140 


Pro 


Gin 


Ala 


Thr 


He 


Asii 


Val Tyr 

155 


Thr 


Arg 


He 


Val 


Leu 


He 


Gly Val 
170 


Val 


Asp 


Val 


He 


Glu 


Arg 


Thr Tyr 
185 


Glu 


Ser 


Val 


Leu 


Phe 


Phe 


Phe Phe 
200 


He 


Cys 


Tyr 


Leu 


Leu 


Glu 


Arg Arg 


Trp 


Ala 


Gin 
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<210> 23 
<211> 463 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7482754CD1 

<400> 23 



Met 


Glu 


Gly 


Gin 


Thr 


Pro 


Gly Ser 


1 








5 






His 


Pro 


Ala 


Thr 


Ala 
20 


Ala 


Ala Thr 


Phe 


He 


Leu 


Met 


Lys 
35 


Ser 


Ala Leu 


Pro 


Trp 


Ala 


Phe 


Ser 
50 


Lye 


Ala Gly 


val 


Glu 


Leu 


Val 


Ser 
65 


Leu 


Val Phe 


Leu 


Gly 


Tyr 


Ala 


Ala 
80 


Ala 


val Ser 


val 


Val 


Arg 


Gly 


Leu 

95 


Cys 


Gly Pro 


Ala 


Cys 


Phe 


Leu 


Leu 
110 


Asn 


Leu Leu 


Arg 


Val 


He 


Gly 


Asp 
125 


Gin 


Leu Glu 


ser 


Gly 


Thr 


Pro 


Pro 
140 


Ala 


Pro Gin 


Phe 


Thr 


Leu 


Pro 


Leu 
155 


Leu 


Ser Val 


Ala 


Pro 


Arg 


Glu 


He 
170 


Ala 


Phe Gin 


Gly 


His 


Cys 


Val 


ser 
185 


He 


Leu Gly 


Ala 


Leu 


Val 


He 


Thr 


Val 


Gin Tyr 





40 






45 


Gly 


Val 


Leu Arg Val 


Tyr 


Leu 




55 






60 


Leu 


Val 


Trp Leu Tyr 


Leu 


Leu 




70 






75 


Gly 


Leu 


Ser He Pro 


Ser 


Phe 




85 






90 


Leu 


Trp 


Gly Ala Ser 


Glu 


Val 




100 






105 


His 


Ser 


Leu Pro Arg 


Gly 


Gin 




115 






120 


Leu 


Ser 


Asp Leu Gin 


Leu 


Tyr 




130 






135 


Leu 


Arg 


Arg Met Thr 


Pro 


Pro 




145 






150 


He 


Lys 


Thr Ser Ser 


Leu 


Ala 




160 






165 


He 


Lys 


Val Gly Gin 


Gin 


He 




175 






180 


Leu 


He 


Tyr Gly Ala 


Leu 


Phe 




190 






195 


Pro 


Leu 


Ser Ala Ala 


Ser 


Lys 




205 






210 


Ala 










Arg 


Gly 


Leu Pro Glu 


Lys 


Pro 




10 






15 


Leu 


Ser 


Ser Met Gly 


Ala 


Val 




25 






30 


Gly 


Ala 


Gly Leu Leu 


Asn 


Phe 




40 






45 


Gly 


Val 


Val Pro Ala 


Phe 


Leu 




55 






60 


Leu 


He 


Ser Gly Leu 


Val 


He 




70 






75 


Gly 


Gin 


Ala Thr Tyr 


Gin 


Gly 




85 






90 


Ala 


He 


Gly Lys Leu 


Cys 


Glu 




100 






105 


Met 


He 


Ser Val Ala 


Phe 


Leu 




115 






120 


Lys 


Leu 


Cys Asp Ser 


Leu 


Leu 




130 






13S 


Pro 


Trp 


Tyr Ala Asp 


Gin 


Arg 




145 






150. 


Leu 


val 


He Leu Pro 


Leu 


Ser 




160 






165 


Lys 


Tyr 


Thr Ser Pro 


Ser 


His 




175 






180 


Thr 


Leu 


Ala Ala Cys 


tfyr 


Leu 




190 






195 


Tyr 


Leu 


Trp Pro Gin 


Gly 


Leu 
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200 








Val 


Arg 


Glu Ser 


His 
215 


Pro 


Ser 


Leu 


Val 


Phe 


Ser Val 


Phe 
230 


Pro 


Thr 


He 


Glu 


Ala 


Ala Val 


Ser 
245 


He 


Tyr 


Cys 


Sex 


His 


Trp Ala 


Leu 
260 


Val 


Ser 


Val 


Leu 


He 


Tyr Ser 


Leu 
275 


Thr 


Gly 


Val 


Thr 


Glu 


Val Ser 


Ala 
290 


Asp 


Val 


Leu 


Met 


Val 


He He 


Val 
305 


Ala 


Arg 


Val 


Thr 


Val 


Tyr Pro 


He 
320 


Val 


Leu 


Phe 


Asp 


Phe 


Trp Arg 


Arg 
335 


Ser 


Cys 


Leu 


Leu 


Ala 


Asp Pro 


Ser 
350 


Gly 


Leu 


Trp 


Leu 


Trp 


Val Thr 


Val 

365 


Thr 


Leu 


Ala 


Leu 


Ser 


Glu He 


Val 
380 


Ser 


He 


He 


He 


Phe 


He Phe 


Pro 
395 


Gly 


Pro 


Gly 


Gly 


Trp 


Phe Leu 


Arg 
410 


Lys 


Pro 


Ser 


Glu 


Lys 


Gly Lys 


Pro 
425 


Ala 


Arg 


Phe 


Cys 


Arg 


Ala Tyr 


Arg 
440 


Thr 


Lys 


Ser 


Thr 


Gly 


Arg Glu 


Ser 
455 


Leu 


Ser 


Thr 



<210> 24 

<211> 1043 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 3794818CD1 

<400> 24 



Met 


Glu 


Phe 


Val 


Arg 


Ala 


Leu 


Trp 


1 








5 








Gly 


Pro 


Gly 


Ser 


Ala 
20 


Gly 


Gly 


His 


Ala 


Arg 


Leu 


Gly 


Gly 
35 


Ser 


Val 


Arg 


Ala 


Pro 


Leu 


Ala 


Arg 
50 


Ala 


Arg 


Ala 


Ala 


Leu 


Ala 


Pro 


Arg 
65 


Leu 


Pro 


His 


Val 


Ala 


Ala 


Pro 


Pro 
80 


Ala 


Arg 


Asp 


Leu 


Cys 


Gin 


Ala 


Leu 
95 


Val 


Pro 


Pro 


Phe 


Pro 


Glu 


Ala 


Arg 
110 


Pro 


Glu 


Leu 


Ala 


Ala 


Thr 


Glu 


Thr 


Pro 


Val 


Leu 



205 




210 


Ser Pro 


Ala Ser Tirp Thr 


Ser 


220 




225 


Cys Phe 


Gly Phe Gin Cys 


His 


235 




240 


Ser Met 


Arg Lys Arg Ser 


Leu 


250 




255 


Leu Ser 


Leu Leu Ala Cys 


Cys 


265 




270 


Tyr Gly 


Phe Leu Thr Phe 


Gly 


280 




285 


Met Ser 


Tyr Pro Gly Asn 


Asp 


295 




300 


Leu Phe 


Ala Val Ser He 


Val 


310 




315 


Leu Gly 


Arg Ser Val Met 


Gin 


325 




330 


Gly Gly 


Trp Gly Pro Ser 


Ala 


340 




345 


Val Arg 


Met Pro Leu Thr 


He 


355 




360 


Met Ala 


Leu Phe Met Pro 


Asp 


370 




375 


Gly Gly 


He Ser Ser Phe 


Phe 


385 




390 


Gly Lys 


His Arg Phe Cys 


Glu 


400 




405 


Gin Thr 


Ala Gin Ala Leu 


Ser 


415 




420 


Val Pro 


His Leu Cys Asn 


Gly 


430 




435 


Gin Val 


Arg Asn Leu Arg 


Ala 


445 




450 


Gin Tyr 


Glu Gly He 




460 






Leu Gly 


Leu Ala Leu Ala 


Leu 


10 




15 


Pro Gin 


Pro Cys Gly Val 


Leu 


25 




30 


Leu Gly 


Ala Leu Leu Pro 


Arg 


40 




45 


Arg Ala 


Ala Leu Ala Arg 


Ala 


55 




60 


Asn Leu 


Ser Leu Glu Leu 


val 


70 




75 


Pro Ala 


Ser Leu Thr Arg 


Gly 


85 




90 


Gly Val 


Ala Ala Leu Leu 


Ala 


100 




105 


Leu Gin 


Leu His Phe Leu 


Ala 


115 




120 


Ser Leu 


Leu Arg Arg Glu 


Ala 
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125 


Arg Ala 


Pro 


Leu 


Gly Ala Pro 








140 


Trp Ala 


Ser 


Pro 


Leu Glu Thr 








155 


Leu Gin 


Ala 


His 


Ala Trp Glu 








170 


Thr Gin 


Asp 


Pro 


Gly Gly Leu 








185 


Gly Arg 


Pro 


Pro 


Gin Leu Val 








200 


Gly Asp 


Ala 


Gly 


Leu Arg Ala 








215 


Val Gly 


Glv 


Glu 


Ala Pro Val 








230 


Asp lie 


Ala 


Aror 


Ala Arg Arg 








245 


Pro His 


Tro 


Leu 


Leu Gly Thr 








260 


Thr Ala 


Glv 


Leu 


Pro Pro Gly 








275 


Arg Pro 


Pro 


Leu 


Glu Ala Ala 






290 


Ala Aro 


Ala 


Leu 


Gly Ser Ala 








305 


Ijeu Leu. 


Pro 


Ala 


Pro Val Asn 








320 


Pro Glu 


Ser 


Pro 


Gly Arg Phe 








335 


Ser Pile 


Gin 


Glv 


Arg Thr Gly 








350 


Gin Val 


His 


Met 


Ser Ara His 








365 


Asp Pro 


Arg 


Glv 


Ala Pro Ala 








380 


AsD Glv 


Gin 


Leu 


Asp Leu Glu 








395 


Pro Pro 


Pro 


Gin 


Gly Ala Gin 








410 


Thr Leu 


Leu 


Glu 


His Pro Phe 








425 


Asp Gly 


Gin 


Cvs 


Pro Ala Gly 








440 


Asn Asp 


Ser 


Ala 


Thr Leu Asp 








455 


Gly Ser 


Ala 


Pro 


Arg Ala Leu 








470 


lie Asp 


Leu 


Leu 


Glu Arg Leu 






485 


Glu Leu 


Tyr 


Leu 


Val Gly Asp 








500 


Gly Arg 


Trp 


Thr 


Gly Leu Val 








515 


His Met 


Ala 


Val 


Thr Ser Phe 








530 


Val Val 


Asp 


Phe 


Thr Ser Pro 






545 


Met Val 


Arg 


Ala 


Arg Asp Thr 








560 


Trp Pro 


Leu 


His 


Trp Ser Thr 








575 


His Leu 


Thr 


Ala 


Leu Phe Leu 








590 



130 135 
Asn Pro Phe His Leu Gin Leu His 

145 150 
Leu Leu Asp Val Leu Val Ala Val 

160 165 
Asp Val Gly Leu Ala Leu Cys Arg 

175 180 
Val Ala Leu Trp Thr Ser Arg Ala 

190 195 
Leu Asp Leu ser Arg Arg Asp Thr 

205 210 
Arg Leu Ala Pro Met Ala Ala Pro 

220 225 
Pro Ala Ala Val Leu Leu Gly Cys 

235 240 
val Leu Glu Ala Val Pro Pro Gly 

250 255 
Pro Leu Pro Pro Lys Ala Leu Pro 

265 270 
Leu Leu Ala Leu GXy Glu VaX Ala 

280 285 
lie His Asp lie Val Gin Leu Val 

295 300 
Ala Gin Val Gin Pro Lys Arg Ala 

310 315 
Cys Gly Asp Leu Gin Pro Ala Gly 

325 330 
Leu Ala Arg Phe Leu Ala Asn Thr 

340 345 
Pro Val Trp Val Thr Gly Ser Ser 

355 360 
Phe Lys Val Trp Ser Leu Arg Arg 

370 375 
Trp Ala Thr Val Gly Ser Trp Arg 

385 390 
Pro Gly Gly Ala Ser Ala Arg Pro 

400 405 
val Trp Pro Lys Leu Arg Val VaX 

415 420 
Val Phe Ala Arg Asp Pro Asp Glu 

430 435 
Gin Leu Cys Leu Asp Pro Gly Thr 

445 450 
Ala Leu Phe Ala Ala Leu AXa Asn 

460 465 
Arg Lys Cys Cys Tyr Gly Tyr Cys 

475 480 
Ala Glu Asp Thr Pro Phe Asp Phe 

490 495 
Gly Lys Tyr Gly Ala Leu Arg Asp 

505 510 
Gly Asp Leu Leu Ala Gly Arg Ala 

520 525 
Ser lie Asn Ser Ala Arg Ser Gin 

535 540 
Phe Phe Ser Thr Ser Leu Gly He 

550 555 
Ala Ser Pro He Gly Ala Phe Met 

565 570 
Trp Leu Gly Val Phe Ala Ala Leu 

580 585 
Thr Val Tyr Glu Trp Arg Ser Pro 

595 600 
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Arg 
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625 


He 


Leu Phe Arg 
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Ser 
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Gly Arg Leu 
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Ala 
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Ala 
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Lys Thr Phe 
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Glu Leu 
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He 
680 


His 


Asp 
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Lys 


Leu 
685 


His 


His Pro Ala 
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Gly Phe 


Arg 
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Gly 
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Val 


Trp 


Glu 


Ser 
700 


Ser 


Ala Glu Ala 


Tyr 
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lie Lys 


Lye 


Ser 


Phe 
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Met 


His 


Ala 

715 
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Met Arg Arg 


His 
720 


Ser Ala 
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Thr 


Thr 
725 


Pro 
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val 


Ala 
730 


Met 


Leu Thr Ser 


Asp 

735 


Pro Pro 


Lys 


Leu 


Asn 
740 


Ala 


Phe 


He 


Het 


Asp 
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Lys 


Ser Leu Leu 


Asp 
750 


Tyr Glu 


Val 


Ser 


He 
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Asp 


Ala 


Asp 


Cys 


Lys 
760 
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Leu Thr Val 


Gly 
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Lys Pro 


Phe 


Ala 
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770 
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Gly 


Tyr 


Gly 


He 
775 
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Leu Pro Gin 


Asn 
780 


Ser Pro 


Leu 


Thr 


Ser 


Asn 


Leu 


Ser 


Glu 


Phe 


He 


Ser Arg Tyr 


Lys 
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795 


Ser Ser 


Gly 


Phe 


He 
800 


Asp 


Leu 


Leu 


His 


Asp 
805 


Lys 


Trp Tyr Lys 


Met 
810 


Val Pro 


Cys 


Gly 


Ly9 
815 


Arg 


Val 


Phe 


Ala 


Val 
820 


Thr 


Glu Thr Leu 


Gin 
825 


Met Ser 


He 


Tyr 


His 


Phe 


Ala 


Gly 


Leu 


Phe 


Val 


Leu Leu Cys 


Leu 








830 










835 




840 


Gly Leu 


Gly 


Ser 


Ala 
845 


Leu 


Leu 


Ser 


Ser 


Leu 
850 


Gly 


Glu His Ala 


Phe 

855 


Phe Arg 


Leu 


Ala 


Leu 
860 


Pro 


Arcr 


He 


Ara 


Lyb 
865 


Gly 


Ser Arg Leu 


Gin 
870 


Tyr Trp 


Leu 


His 


Thr 
875 


Ser 


Gin 


Lys 


He 


His 
880 


Arg 


Ala Leu Asn 


Thr 
885 


Glu Pro 


Pro 


Glu 


Gly 
890 


Ser 


Lys 


Glu 


Glu 


Thr 
895 


Ala 


Glu Ala Glu 


Pro 
900 


Ser Gly 


Pro 


Glu 


Val 
905 


Glu 


Gin 


Gin 


Gin 


Gin 
910 


Gin 


Gin Asp Gin 


Pro 
915 


Thr Ala 


Pro 


Glu 


Gly 
920 


Trp 


Lys 


Arg 


Ala 


Arg 
925 


Arg 


Ala Val Asp 


Lys 
930 


Glu Arg 


Arg 


Val 


Arg 
935 


Phe 


Leu 


Leu 


Glu 


Pro 
940 


Ala 


Val Val Val 


Ala 
945 


Pro Glu 


Ala 


Asp 


Ala 
950 


Glu 


Ala 


Glu 


Ala 


Ala 
955 


Pro 


Arg Glu Gly 


Pro 
960 


Val Trp 


Leu 


Cys 


Ser 
965 


Tyr 


Gly 


Arg 


Pro 


Pro 
970 


Ala 


Ala Arg Pro 


Thr 
975 


Gly Ala 


Pro 


Gin 


Pro 
980 


Gly 


Glu 


Leu 


Gin 


Glu 
985 


Leu 


Glu Arg Arg 


He 
990 


Glu val 


Ala 


Arg 


Glu 


Arg 


Leu 


Arg 


Gin 


Ala 


Leu 


Val Arg Arg 


Gly 








995 








1000 




1005 


Gin Leu 


Leu 


Ala 


Gin 


Leu 


Gly 


Asp 


Ser Ala 


Arg 


His Arg Pro Arg 






1010 








1015 




1020 


Arg Leu 


Leu 


Gin Ala 
1025 


Arg 


Ala 


Ala 


Pro Ala 
1030 


Glu 


Ala Pro Pro His 
1035 


Ser Gly 


Arg 


Pro Gly 
1040 


Ser 


Gin 


Glu 
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10 15 




LiGU 


IrXie 




\9XU 

20 


Leu 


TV OT\ 


oex 


Han 


T.vpo rSl v AT"rt Vi* 1 Hc-n 
l4]fsf f\a^ VXjf X\xy VaX nsa^ 

25 30 
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Arg 
35 


Gin 


Giy 
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Axa 


ATy licU \^±.Y wiJ-j *^Xy A.SII 
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Pro 


laXy 
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Gxn 


Gxn 


Gxy 


Xxe 


oCi oex vji.li vsxy Asp ivxa 
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Pro 


Asp 


Giy 


Giy 
65 


Leu 
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GlU 


Glu Phe Ser Arg Tyr Leu 
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GlU 


Gin 
80 


Arg 


Leu 


Leu 


Leu 


neu xrne xixs oer xieu abp 
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Asn 


Gin 


Asp 


Gly 
95 


HIS 


lie 
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vax 


£>er Gxu xxe Gxn Gxn £>er 
100 105 


pne 


Arg 


Ala 


Leu 


Gly 
110 


xxe 


Ser 


xxe 


Ser 


Xjeu GXU Gxn Axa laxu xtyB 
115 120 
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Ser 


Met^ 
125 


Asp 


Arg 


Asp 


Gxy 


Tnr neu inr xxe Asp Trp 
130 135 


Gin 


GJ.U 


Trp 


Arg 


Asp 
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XT A rm 
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Pne 


Leu 
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IT'S rni Oj&v* T ^^im mv A I 

HIS 2>er Xieu gxu Asn vax 
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GlU 
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val 


Leu 


Tyr 
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Phe 


Trp 


Lys 


HXS 


£>er rnr vax Leu ASp xxe 
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Cys 
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Thr 
1 /U 


val 
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Asp 


GlU 


pne £>er Lys Gxn gxu i^ys 
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Tnr 


Giy 


neu 


Trp 
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Trp 


Lys 


i^xn 


Xieu 


■\7ii 1 Ala ^1 \* Al A t7£i1 &1a 
Vax AXcl uXy VciX HXSL 

190 195 


VI ly 


/LLo 


vax 


oer 


Arg 
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Gxy 
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IVl a 

AX a. 


T^aii &fiEn T.aii Tato 

Jrxo x*eu nop Axy^ jjeu ijy» 
205 210 


VclX 


irne 


neu 


i9±n 
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215 


rtXS 


AX a 


£>er 


Lys 


WVk'ir' &OTt AvfT T.011 AoTl 

IIXL Ae>II iJtsLI noil XAC 

220 225 


Lgu 




uxy 


Leu 


Arg 
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Ser 


Jiet 


\7j*1 


Xjeu 


r!lii r2l filvr Tl ^ RrTf Qo** 
vvxu \9xy Vrxy xxe: Axy o«?x 

235 240 




ixp 






245 




Tl 


Asn 


Val 


T.eii LiVfi Xlc> Ala Pro Olu 
250 255 




Ala 


lie 


Lys 


Phe 
260 


Met 


Ala 




Glu 


Gin Xle Lys Arg Ala Tie 
265 270 


liGU 




Gin 


Gin 


Glu 
275 


Thr 


Leu 


His 


Val 


Gin Glu Ara Phe Val Ala 
280 285 


Glv 


Ser 


Leu 


Ala 


Glv 


Ala 


Thr 


Ala 


Gin 


Thr He He Tyr Pro Met 








290 










295 300 


Glu 


Thr 


Leu 


Lys 


Asn 
305 


Trp 


Trp 


Leu 


Gin 


Gin Tyr Ser His Asp Ser 
310 315 


Ala 


Asp 


Pro 


Gly 


lie 
320 


Leu 


val 


Leu 


Leu 


Ala Cys Gly Thr He Ser 
325 330 


Ser 


Thr 


Cys 


Gly 


Gin 
335 


lie 


Ala 


Ser 


Tyr 


Pro Leu Ala Leu Val Arg 
340 345 


Thr 


Arg 


Met 


Gin 


Ala 
350 


Gin 


Ala 


Ser 


He 


Glu Gly Gly Pro Gin Leu 

355 360 


Ser 


Met 


Leu 


Gly 


Leu 
365 


Leu 


Arg 


His 


He 


Leu Ser Gin Glu Gly Met 
370 375 


Arg 


Gly 


Leu 


Tyr 


Arg 
380 


Gly 


lie 


Ala 


Pro 


Asn Phe Met Lys Val He 
385 390 


Pro 


Ala 


Val 


Ser 


He 
395 


Ser 


Tyr 


Val 


Val 


Tyr Glu Asn Met Lys Gin 
400 405 


Ala 


Leu 


Gly 


Val 


Thr 


Ser 


Arg 


Leu 


Glu 


Tyr Ser Gly Ser He Ser 
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410 


415 




420 


Asp His Cys Asn Leu 


Cys Leu pro Gly Ser Ser Asp Ser 


Pro 


Ala 


425 


430 




435 


Ser Ala Ser Arg Val 


Ala Gly He Thr Gly Phe His His 


Val 


Ala 


440 


445 




450 


Gin Ala His Leu Gly 


Leu Val Gly Ser Arg Asn Ser Ala 


Ala 


Phe 


455 


460 




465 


Ser Leu Pro Thr Cys 


Trp Asp Tyr Arg Lys Pro Val Val 


Met 


Pro 


470 


475 
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Arg 
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Glu 


Ser Glu Gin Glu Pro Leu Leu Gly 
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Pro 
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Ser 
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Glu 


Trp Asp He Leu Glu Thr Glu Glu 
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Tyr 


Lys 


Ser 


Arg 


Trp 


Arg 
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40 45 


Met 


Phe 
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Ser 


Ser 


val 


Gly 


Phe Ser Val Val Met Met Ser He 
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Trp 


Pro 


Tyr 


Leu 


Gin 


Lys 


He 


Asp Pro Thr Ala Asp Thr Ser Phe 
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70 75 


Leu 


Gly 


Trp 


Val 


He 


Ala 


Ser 


Tyr Ser Leu Gly Gin Met Val Ala 
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Ser 


Pro 


He 


Phe 


Gly 

95 


Leu 


Trp 


ser Asn Tyr Arg Pro Arg Lys Glu 
100 105 
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Leu 


He 


Val 


Ser 
110 


He 


Leu 


He Ser Val Ala Ala Asn Cys Leu 
115 120 


Tyr 


Ala 


Tyr 


Leu 


His 


He 


Pro 


Ala Ser His Asn Lys Tyr Tyr Met 
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Leu 


Val 


Ala 


Arg 


Gly 


Leu 


Leu 


Gly He Gly Ala Gly Asn Val Ala 








140 






145 150 


Val 


Val 


Arg 


Ser 


Tyr 


Thr 


Ala 


Gly Ala Thr Ser Leu Gin Glu Arg 
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160 165 


Thr 


Ser 


Ser 


Met 


Ala 
170 


Asn 


He 


Ser Met Cys Gin Ala Leu Gly Phe 
175 180 


He 


Leu 


Gly 


Pro 


val 


Phe 


Gin 


Thr Cys Phe Thr Phe Leu Gly Glu 








185 






190 195 


Lys 


Gly 


Val 


Thr 


Trp 


Asp 


Val 


He Lys Leu Gin He Asn Met Tyr 






200 
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Thr 


Thr 


Pro 


Val 


Leu 
215 


Leu 


Ser 


Ala Phe Leu Gly He Leu Asn He 

220 225 


He 


Leu 


He 


Leu 


Ala 
230 


He 


Leu 


Arg Glu His Arg Val Asp Asp Ser 
235 240 


Gly 


Arg 


Gin 


Cys 


Lys 


Ser 


He 


Asn Phe Glu Glu Ala Ser Thr Asp 






245 






250 255 


Glu 


Ala 


Gin 


val 


Pro 


Gin 


Gly 


Asn He Asp Gin Val Ala Val Val 










260 




265 270 


Ala 


He 


Asn 


Val 


Leu 
275 


Phe 


Phe 


Val Thr Leu Phe He Phe Ala Leu 
280 285 


Phe 


Glu 


Thr 


He 


He 
290 


Thr 


Pro 


Leu Thr Met Asp Met Tyr Ala Trp 
295 300 


Thr 


Gin 


Glu 


Gin 


Ala 
305 


Val 


Leu 


Tyr Asn Gly He He Leu Ala Ala 
310 315 
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Leu 


Gly 


Val Glu 


Ala 

320 


Val 


Val 
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Lys 
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Gly 
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Ala 


Val 


Val 


Trp Val 


Gly 
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Glu 
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Glu 
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Glv 
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Met 
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Glv 
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Thr 
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T1 fifc 
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He 
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Thr 


Thr 
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Val 


Cvs 


Asn 


Leu 




430 
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Pro 
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Gin 
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Gly 
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Ala 
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Val 
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Gly 
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Ser 
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55 
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Val 
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135 


Trp 


Tyr 


Phe 


val 
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He 
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Ser 


Leu 


He 


Ala Gly 
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He 


Trp 


Val 
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He 


Val 


Pro Gly 
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Thr 


Phe 
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230 
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Val 
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Met Gin Ala His Asn Thr Glu Asn 
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Val Leu Leu His Thr Ser Arg Lys 
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Lys Glu Glu He Arg Ala His Arg 
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Glu Ser Gly Leu 


Glu 
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Gin 
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Met 
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750 


Val 


Gin 


Glu 


Pro Leu 








765 


Arg 


Arg 


Ser 


Gly Gly 








780 


Thr 


Ser 


Asp 


He Asp 








795 


Asp 


Leu 


Leu 


Ser Tyr 








810 


Pro 


Leu 


Trp 


Arg Thr 








825 


Leu 


Tyr 


Thr 


Cys Ala 








840 


Ala 


Phe 


Ser 


Phe Phe 








855 


Met 


Thr 


Ser 


Trp He 








870 


Thr 


Tyr 


Arg 


Lys Asn 








885 


Gin 


Ala 


Val 


Ala Gly 








900 


Ala 


He 


Met 


Gin Glu 








915 


He 


Leu 


Pro 


Asp Asp 








930 


Ser 


Leu 


Leu 


Glu Val 








945 


Asn 


Gin 


Pro 


Thr He 








960 


Ala 


Leu 


Asn 


Gly Asp 








975 


Ser 


Pro 


Asn 


Thr Ser 








990 


Gin 


Pro 


Gin 


Thr Leu 








1005 


Ser 


Leu 


Gly 


Gin Val 








1020 


Ala 


Thr 


Glu 


Gly Leu 








1035 


Phe 


Ala 


Glu 


Gly Glu 








1050 


Asp 


Asp 


Ser 


Glu Pro 








1065 


Val 


Tyr 


Thr 


Glu Gly 








1080 


Val 


Arg 


Val 


Asn He 








1095 


Ser 


Phe 


Gin 


Thr Ala 
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1100 


Ser 


Arg 


Ser 


Val He 








1115 


Val 


He 


Arg 


Thr Phe 








1130 


Lys 


He 


He 


Gly Gin 








1145 


Gly 


Gin 


Leu 


Phe Phe 






1160 


Val 


His 


Leu 


Leu Asp 








1175 


Gin 


Val 


He 


Leu Tyr 








1190 


Gly 


He 


Ala 


Leu Leu 






1205 


Val 


Glu 


Ala 


Ser Asp 








1220 


Ser 


Ser 


Arg 


Phe Val 








1235 


Leu 


Phe 


He 


Asn Arg 








1250 


Thr 


Tyr 


Thr 


Thr val 






1265 


Val 


Gly 


Asn 


Leu Ala 






1280 


Gly 


Phe 


Leu 


He Leu 








1295 


He 


Thr 


He 


Leu Glu 








1310 


Leu 


Val 


Asn 


Leu Thr 








1325 


Ser 


Phe 


Pro 


Pro Arg 








1340 


He 


He 


Asp 


Ala Asn 








1355 


Gin 


Ser 


Arg 


Phe Glu 








1370 


Val 


Ala 


Gin 


Arg Ser 








1385 


Val 


Tyr 


Ala 


Gin Asn 






1400 


Phe 


Thr 


Pro 


Met Xle 








1415 


Asn 


Val 


Asn 


He Met 








1430 


Glu 


Lys 


Phe 


Gin Leu 








1445 


Leu 


Gly 


Lye 


Asn Thr 








1460 


Asp 


Gly 


Pro 


Gly Val 








1475 


Leu 


Arg 


Glu 


Pro Thr 








1490 


Leu 


Tyr 


He 


Val Arg 








1505 


Thr 


Val 


Gin 


Phe He 








1520 


Lys 


Asp 


Leu 


Thr Pro 








1535 


Val 


Arg 


Phe 


Lys Ala 








1550 


Pro 


Glu 


Met 


Asp Glu 








1565 









1105 


Gly 


His 


Glu 


Gly Glu 








1120 


Pro 


Gly 


Arg 


Gly Asn 








1135 


Asn 


Gin 


Glu 


Leu Asn 








1150 


Pro 


Glu 


Gly 


Ser Leu 








1165 


Asp 


Asn 


He 


Pro Glu 








1180 


Asp 


Val 


Arg 


Thr Gin 








1195 


Asp 


Ala 


Gin 


Gly Tyr 








1210 


Glu 


Pro 


His 


Gly Val 








1225 


Leu 


Leu 


Gin 


Glu Ala 








1240 


Glu 


Phe 


Gly 


Ser Leu 








1255 


Pro 


Gly 


Met 


Leu Ser 








1270 


Glu 


Pro 


Glu 


Val Asp 








1285 


Glu 


Glu 


Gly 


Glu Thr 








1300 


Asp 


Asp 


Val 


Pro Glu 








1315 


Tyr 


val 


Gly 


Leu Thr 








1330 


Leu 


Asp 


Ser 


Glu Gly 








1345 


Asp 


Gly 


Ala 


Arg Gly 








1360 


Val 


Asn 


Glu 


Thr His 








1375 


Arg 


Glu 


Pro 


Leu Gly 








1390 


lieu 


Glu 


Ala 


Gin Val 








1405 


Leu 


His 


Phe 


Ala Asp 








1420 


He 


Leu 


Asp 


Asp Asp 








1435 


He 


Leu 


Thr 


Asn Pro 








1450 


He 


Ala 


Leu 


He He 








1465 


Leu 


Ser 


Phe 


Asn Asn 








1480 


Ala 


Leu 


Tyr 


Val Gin 








1495 


Glu 


Pro 


Ala 


Gin Gly 








1510 


Val 


Thr 


Glu 


Val Asn 








1525 


Ser 


Lys 


Gly 


Tyr He 








1540 


Leu 


Gin 


lie 


Ser Ala 








1555 


Tyr 


Phe 


Val 


Cys Thr 








1570 









1110 


He 


Leu 


Gin 


Phe His 








1125 


Val 


Thr 


val 


Asn Trp 








1140 


Phe 


Ala 


Asn 


Phe Ser 








1155 


Asn 


Thr 


Thr 


Leu Phe 








1170 


Glu 


Lys 


Glu 


Val Tyr 








1185 


Gly 


Val 


Pro 


Pro Ala 








1200 


Ala 


Ala 


Val 


Leu Thr 








1215 


Leu 


Asn 


Phe 


Ala Leu 








1230 


Asn 


He 


Thr 


He Gin 








1245 


Gly 


Ala 


He 


Asn Val 








1260 


Leu 


Lys 


Asn 


Gin Thr 








1275 


Phe 


Val 


Pro 


He He 








1290 


Ala 


Ala 


Ala 


He Asn 








1305 


Leu 


Glu 


Glu 


Tyr Phe 








1320 


Met 


Ala 


Ala 


Ser Thr 








1335 


Leu 


Thr 


Ala 


Gin Val 








1350 


Val 


He 


Glu 


Trp Gin 








1365 


Gly 


Ser 


Leu 


Thr Leu 








1380 


His 


val 


Ser 


Leu Phe 








1395 


Gly 


Leu 


Asp 


Tyr He 








1410 


Gly 


Glu 


Arg 


Tyr Lys 








1425 


He 


Pro 


Glu 


Gly Asp 








1440 


Ser 


Pro 


Gly 


Leu Glu 








14SS 


Val 


Leu 


Ala 


Asn Asp 








1470 


Ser 


Glu 


Hie 


Phe Phe 








1485 


Glu 


Ser 


val 


Ala Val 








1500 


Leu 


Phe 


Gly 


Thr Val 








1515 


Ser 


Ser 


Asn 


Glu Ser 








1530 


Val 


Leu 


Glu 


Glu Gly 








1545 


He 


Leu 


Asp 


Thr Glu 








1560 


Leu 


Phe 


Asn 


Pro Thr 








1575 
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Gly 


Gly 


Ala 


Arg heu 








1580 


Leu 


Gin 


Asn 


Gin Ala 








1595 


Glu 


Asn 


Arg 


Ala Thr 








1610 


Val 


Tyr 


Leu 


Asn Val 








1625 


Leu 


Asn 


He 


Glu Asn 








1640 


Phe 


Ser 


Pro 


Tyr Phe 








1655 


Lys 


Pro 


Ser 


Leu Asn 








1670 


XiGU 


Phe 


Leu 


Val Gin 








1685 






Phe 


Thr Ser 








1700 


Gin 


Arg 


AST3 










1715 


R1 V 




Phe 


Val T.oii 

vox 








1730 


Thr 


Val 


Ala 










1745 




Gin 


Ala 


Asn Ala 








1760 


Glv 




Glv 


Phe He 








1775 


Thr 


Glu 


Val 


Glu Ala 








1790 


Phe 


Ala 


Lys 


Asn Val 








1805 


Phe 


lie 




Glu Met 








1820 


Val 


Asp 


Phe 


Ala Ala 








1835 




Glv 


He 


Ala Hifi 








1850 








TiTFk Ann 








1865 


Glu 


Val 


Pro 


Rrt Ala 








1880 


Leu 


Asn 


Ser 


Ser LvB 








1895 


His 


He 


Tvr 


Glu Leu 








1910 


Pro 


Ser 


Ser 


Gly Glu 








1925 


Thr 


He 


Ala 


Val Asn 








1940 


Glu 


Ser 


Phe 


Lys Val 








1955 


He 


Gly 


He 


Asn Asp 








1970 


Aep 


Ala 


Tyr 


Gly He 








1985 


Gin 


Val 


Glu 


Glu Met 








2000 


Glu 


Arg 


Leu 


Lys Gly 








2015 


Ala 


Asp 


Gly 


Ser He 








2030 


Glu 


Lys 


Arg 


Met Ser 



Gly 


Val 


His 


Val Gin 






1585 


Pro 


Leu 


Gly 


Leu Phe 








1600 


Ser 


He 


Asp 


He Glu 








1615 


Ser 


Arg 


Thr 


Asn Gly 








1630 


Pro 


Lvs 


Thr 


Cys Glu 








1645 


Val 


He 


Thr 


His Glu 








1660 


Ser 


Val 


Phe 


Thr Phe 








1675 


Thr 


He 


He 


Tlf» T.f»iJ 








1690 


Asp 


Ser 


Gin 














T ■■ 
AjSU 


Thr 


Gin Val 








1720 


His 


Gin 


Lys 










1735 


Asn 


Lys 


Glv 


wX_y «9c?Ji 








1750 




Leu 


Asn 


S/CX JUCU 








1765 


Asn 


Phe 


Gin 


Glu Val 








1780 








AIa ASir) 
Axa noli 








1795 


Phe 


x^eu 


Glv 


AstD Gin 








1810 




Gin 


Cot* 

OBJ. 










1825 


Val 


Asn 




He His 








1840 


tie 




Leu 


He Gly 








1855 




Gill 












1870 


Tvr 


Asp 


Val 


Aid fif^r 








1885 


Asil 


Leu 


He 


Ala Leu 








1900 


Ala 




He 










1915 


Leu 


He 


Phe 


Glu Pro 








1930 


He 


Leu 


Asp 


Asp Thr 








1945 


Gin 


Ifeu 


Lvs 


Asn Pro 








1960 


Ser 


Val 


Thr 


He Thr 








1975 


Val 


Ala 


Phe 


Ala Gin 








1990 


Glu 


Gin 


Asp 


Ser Leu 








2005 


Thr 


Tyr 


Gly 


Arg He 








2020 


Ser 


Asp 


He 


Phe Pro 








2035 


Ala 


Lys 


He 


Leu Phe 



Thr 


Leu 


He 


Thr Val 








1590 


Ser 


He 


Ser 


Ala Val 








1605 


Glu 


Ala 


Asn 


Arg Thr 








1620 


He 


Asp 


Leu 


Ala Asp 








1635 


Ala 


Phe 


Asn 


He Gly 








1650 


Glu 


Arg 


Asn 


Glu Glu 








1665 


Thr 


Ser 


Gly 


Phe Lys 








1680 


Glu 


Ser 


Ser 


Gin Val 








1695 


Leu 


He 


He 


Ala Ser 








1710 


Phe 


Arcr 

"Xy 


TiTD 

AX^ 


Asn Gl V 








1725 


Val 


nxv^ 


Glv 

^^J.y 


Val Leu 








1740 


Val 


Phe 


Leu 


Ala He 








1755 


Leu 


Phe 


Ara 


Trp Ser 








1770 


Pro 


Val 


Ser 


Gly Thr 








1785 


Asp 


He 


Tvr 


Leu He 








1800 


Asn 


Ser 


He 


Asp lie 








1815 


Arg 


Tvr 


Phe 


Gin Ser 








1830 


Ser 


Phe 


Thr 


Pro Ala 








1845 


Gin 


Asp 


Met 


Ser Ala 








I860 


Phe 


Ser 


Phe 


Val Leu 








1875 


Val 


Thr 


Val 


Lys Ser 








1890 


Val 


Glv 


Ala 


His Ser 








1905 


His 


Ser 


Asp 


Phe He 








1920 


Gly 


Glu 


Arg 


Glu Ala 








1935 


Val 


Pro 


Glu 


Lys Glu 








1950 


Lys 


Gly 


Gly 


Ala Glu 








1965 


He 


Leu 


Ser 


Asn Asp 








1980 


Asn 


Ser 


Leu 


Tyr Lys 








1995 


Val 


Thr 


Leu 


Asn Val 








2010 


Thr 


He 


Ala 


Trp Glu 








2025 


Thr 


Ser 


Gly 


Val val 








2040 


Thr 


Glu 


Gly 


Gin val 
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2045 2050 2055 

Leu Ser Thr He Thr Leu Thr He Leu Ala Asp Asn He Pro Glu 

2060 2065 2070 

Leu Ser Glu Val val lie Val Thr Leu Thr P^g He Thr Thr Glu 

2075 2080 2085 

Gly val Glu Asp Ser Tyr Lys Gly Ala Thr He Asp Gin Asp Arg 

2090 2095 2100 

Ser Lys Ser Val He Thr Thr Leu Pro Asn Asp Ser Pro Phe Gly 

2105 2110 2115 

Leu Val Gly Tip Arg Ala Ala Ser Val Phe He Arg Val Ala Glu 

2120 2125 2130 

Pro Lys Glu Asn Thr Thr Thr Leu Gin Leu Gin He Ala Arg Asp 

2135 2140 2145 

Lys Gly Leu Leu Gly Asp He Ala He His Leu Arg Ala Gin Pro 

2150 2155 2160 

Asn Phe Leu Leu His Val Asp Asn Gin Ala Thr Glu Asn Glu Asp 

2165 2170 2175 

Tyr Val Leu Gin Glu Thr He He He Met Lys Glu Asn He Lys 

2180 2185 2190 

Glu Ala His Ala Glu Val Ser He Leu Pro Asp Asp Leu Pro Glu 

2195 2200 2205 

Leu Glu Glu Gly Phe He Val Thr He Thr Glu Val Asn Leu Val 

2210 2215 2220 

Asn Ser Asp Phe Ser Thr Gly Gin Pro Ser Val Arg Arg Pro Gly 

2225 2230 2235 

Met Glu He Ala Glu lie Met He Glu Glu Asn Asp Asp Pro Arg 

2240 2245 2250 

Gly He Phe Met Phe His Val Thr Arg Gly Ala Gly Glu Val He 

2255 2260 2265 

Thr Ala Tyr Glu Val Pro Pro Pro Leu Asn Val Leu Gin Val Pro 

2270 2275 2280 

val val Arg Leu Ala Gly Ser Phe Gly Ala Val Asn Val Tyr Trp 

2285 2290 2295 

Lys Ala Ser Pro Asp Ser Ala Gly Leu Glu Asp Phe Lys Pro Ser 

2300 2305 2310 

His Gly He Leu Glu Phe Ala Asp Lys Gin Val Thr Ala Met He 

2315 2320 2325 

Glu He Thr He He Asp Asp Ala Glu Phe Glu Leu Thr Glu Thr 

2330 2335 2340 

Phe Asn He Ser Leu He Ser Val Ala Gly . Gly Gly Arg Leu Gly 

2345 2350 2355 

Asp Asp Val val val Thr val Val He Pro Gin Asn Asp Ser Pro 

2360 2365 2370 

Phe Gly Val Phe Gly Phe Glu Glu Lys Thr Val Met He Asp Glu 

2375 2380 2385 

Ser Leu Ser Ser Asp Asp Pro Asp Ser Tyr Val Thr Leu Thr Val 

2390 2395 2400 

Val Arg Ser Pro Gly Gly Lys Gly Thr Val Arg Leu Glu Trp Thr 

2405 2410 2415 

He Asp Glu Lys Ala Lys His Asn Leu Ser Pro Leu Asn Gly Thr 

2420 2425 2430 

Leu His Phe Asp Glu Thr Glu Ser Gin Lys Thr He Val Leu His 

2435 2440 2445 

Thr Leu Gin Asp Thr Val Leu Glu Glu Asp TUrg Arg Phe Thr He 
2450 2455 2460 

Gin Leu He Ser He Asp Glu Val Glu He Ser Pro Val Lys Gly 
2465 2470 2475 

Ser Ala Ser He He He Arg Gly Asp Lys Arg Ala Ser Gly Glu 
2480 2485 2490 

Val Gly He Ala Pro Ser Ser Arg His He Leu He Gly Glu Pro 
2495 2500 2505 

Ser Ala Lys Tyr Asn Gly Thr Ala He He Ser Leu Val Arg Gly 
2510 2515 2520 
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Pro Gly 


He 


Leu Gly 


Glu 


Val 


Thr 


Val 


Phe 


Ttp Arg 


He 


Phe Pro 






2525 








2530 






2535 


Pro Ser 


Val 


Gly Glu 


Phe 


Ala 


Glu 


Thr 


Ser 


Gly Lys 


Leu 


Thr Met 






2540 








2545 






2550 


Arg Asp 


Glu 


Gin 


Ser 


Ala 


Val 


He 


Val 


Val 


lie Gin 


Ala 


Leu Asn 




2555 








2560 






2565 


Asp Asp 


lie 


Pro 


Glu 


Glu 


Lys 


Ser 


Phe Tyr 


Glu Phe 


Gin 


Leu Thr 






2570 








2575 






2580 


Ala Val 


Ser 


Glu Gly 


Gly 


Val 


Leu 


Ser Glu 


Ser Ser 


Ser 


Thr Ala 






2585 








2590 






2595 


Asn lie 


Thr 


Val 


Val 


Ala 


Ser 


Asp 


Ser 


Pro 


Tyr Gly 


Arg 


Phe Ala 






2600 








2605 






2610 


Phe Ser 


His 


Glu Gin 


Leu 


Arg 


Val 


Ser Glu 


Ala Gin 


Arg 


Val Asn 






2615 








2620 






2625 


He Thr 


He 


He Arg 


Ser 


Ser 


Gly 


Asp Phe 


Gly His 


Val 


Arg Leu 






2630 








2635 






2640 


Trp Tyr 


Lys 


Thr 


Met 


Ser 


Gly 


Thr 


Ala 


Glu 


Ala Gly 


Leu 


Asp Phe 






2645 








2650 






2655 


Val Pro 


Ala 


Ala Gly 


Glu 


Leu 


Leu 


Phe Glu 


Ala Gly 


Glu 


Met Arg 






2660 








2665 






2670 


Lys Ser 


Leu 


His 


Val 


Glu 


He 


Leu 


Asp Asp 


Asp Tyr 


Pro 


Glu Gly 






2675 








2680 






2685 


Pro Glu 


Glu 


Phe 


Ser 


Leu 


Thr 


He 


Thr Lys 


Val Glu 


Leu 


Gin Gly 






2690 








2695 






2700 


Arg 
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<400> 31 






















Met Asp 


Thr 


Pro 


Arg 


Gly 


He 


Gly 


Thr 


Phe 


Val Val 


Trp 


Asp Tyr 


1 






5 










10 






15 


Val val 


Phe 


Ala 


Gly 


Met 


Leu 


Val 


He 


Ser 


Ala Ala 


He 


Gly He 








20 










25 






30 


Tyr Tyr 


Ala 


Phe 


Ala 


Gly 


Gly 


Gly 


Gin 


Gin 


Thr Ser 


Lys 


Asp Phe 








35 










40 






45 


Leu Met 


Gly 


Gly 


Arg 


Arg 


Met 


Thr 


Ala 


Val 


Pro Val 


Ala 


Leu Ser 








50 










55 






60 


Leu Thr 


Ala 


Ser 


Phe 


Met 


Ser 


Ala 


val 


Thr 


Val Leu 


Gly 


Thr Pro 








65 










70 






75 


Ser Glu 


Val 


Tyr 


Arg 


Phe 


Gly 


Ala 


He 


Phe 


Ser He 


Phe 


Ala Phe 








80 










85 






90 


Thr Tyr 


Phe 


Phe 


val 


Val 


Val 


He 


Ser 


Ala 


Glu Val 


Phe 


Leu Pro 








95 










100 






105 


Val Phe 


Tyr 


Lys 


Leu 


Gly 


He 


Thr 


Ser 


Thr 


Tyr Glu 


Tyr 


Leu Glu 








110 










115 






120 


Leu Arg 


Phe 


Asn 


Lys 


Cys 


Val 


Arg 


Leu 


Cys 


Gly Thr 


Val 


Leu Phe 








125 










130 






135 


lie val 


Gin 


Thr 


He 


Leu 


Tyr 


Thr 


Gly 


He 


Val He 


Tyr 


Ala Pro 








140 










145 






150 


Ala Leu 


Ala 


Leu 


Asn 


Gin 


Val 


Thr 


Gly 


Phe 


Asp Leu 


Trp 


Gly Ala 








155 










160 






165 


val Val 


Ala 


Thr 


Gly 


Val 


Val 


Cys 


Thr 


Phe 


Tyr Cys 


Thr 


Leu Gly 








170 










175 






180 


Gly Leu 


Lys 


Ala 


Val 


He 


Trp 


Thr 


Asp 


Val 


Phe Gin 


Val 


Gly He 






185 










190 






195 
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Met Val Ala Gly Phe Ala Ser Val He He Gin Ala Val Val Met 

200 205 210 

Gin Gly Gly He Ser Thr He Leu Asn Asp Ala Tyr Asp Gly Gly 

215 220 225 

Arg Leu Asn Phe Trp Asn Phe Asn Pro Asn Pro Leu Gin Arg His 

230 235 240 

Thr Phe Trp Thr He He He Gly Gly Thr Phe Thr Trp Thr Ser 

245 250 255 

He Tyr Gly Val Asn Gin Ser Gin Val Gin Arg Tyr He Ser Cys 

260 265 270 

Lys Ser Arg Phe Gin Ala Lys Leu Ser Leu Tyr He Asn Leu Val 

275 280 285 

Gly Leu Trp Ala He Leu Thr Cys Ser Val Phe Cys Gly Leu Ala 

290 295 300 

Leu Tyr Ser Arg Tyr His Asp Cys Asp Pro Trp Thr Ala Lys Lys 

305 310 315 

Val Ser Ala Pro Asp Gin Leu Met Pro Tyr Leu Val Leu Asp He 

320 325 330 

Leu Gin Asp Tyr Pro Gly Leu Pro Gly Leu Phe Val Ala Cys Ala 

335 340 345 

Tyr Ser Gly Thr Leu Ser Thr Val Ser Ser Ser He Asn Ala Leu 

350 355 360 

Ala Ala Val Thr Val Glu Asp Leu He Lys Pro Tyr Phe Arg Ser 

365 370 375 

Leu Ser Glu Arg Ser Leu Ser Trp He Ser Gin Gly Met Ser Val 

380 385 390 

val Tyr Gly Ala Leu Cys He Gly Met Ala Ala Leu Ala Ser Leu 

395 400 405 

Met Gly Ala Leu Leu Gin Ala Ala Leu Ser Val Phe Gly Met Val 

410 415 420 

Gly Gly Pro Leu Met Gly Leu Phe Ala Leu Gly He Leu Val Pro 

425 430 435 

Phe Ala Asn Ser He Gly Ala Leu Val Gly Leu Met Ala Gly Phe 

440 445 450 

Ala He Ser Leu Trp Val Gly He Gly Ala Gin He Tyr Pro Pro 

455 460 465 

Leu Pro Glu Arg Thr Leu Pro Leu His Leu Asp He Gin Gly Cys 

470 475 480 

Asn Ser Thr Tyr Asn Glu Thr Asn Leu Met Thr Thr Thr Glu Met 

485 490 495 

Pro Phe Thr Thr Ser Val Phe Gin He Tyr Asn Val Gin Arg Thr 

500 505 510 

Pro Leu Met Asp Asn Trp Tyr Ser Leu Ser Tyr Leu Tyr Phe Ser 

515 520 525 

Thr Val Gly Thr Leu Val Thr Leu Leu Val Gly He Leu Val Ser 

530 535 540 

Leu Ser Thr Gly Gly Arg Lys Gin Asn Leu Asp Pro Arg Tyr He 

545 550 555 

Leu Thr Lys Glu Asp Phe Leu Ser Asn Phe Asp He Phe Lys Lys 

560 565 570 

Lys Lys His Val Leu Ser Tyr Lys Ser His Pro Val Glu Asp Gly 

575 580 585 

Gly Thr Asp Asn Pro Ala Phe Asn His He Glu Leu Asn Ser Asp 

590 595 600 

Gin Ser Gly Lys Ser Asn Gly Thr Arg Leu 

605 610 
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<221> misc^feature 
<223> Incyte ID No: 

<400> 32 





Glv 


Phe 


Asp 


Val 


1 








5 


* 


Gin 


He 


Cys 


Leu 
20 


Leu 




IT X V 


Ann 


35 


Se3c 


IliS 


Arg 


Cys 


Trp 
50 


Asp 


Asn 


Asp 


Thr 


laXy 

65 


Ser 


Tl A 


Pro 


Leu 


Asp 
80 


pne 


lie 


nXS 


Pro 


95 


Pro 


ABn 


Tiir 


Asn 


bXU 

110 




Tyx 


Asp 


Arg 


Ser 
125 


Asp 


Leu 


VaX 


Cys 


uxu 
140 


lieu 


pne 


nee 


ALBL 


oxy 
155 


Leu 


Ser 


Asp 


Arg 


vax 
170 


Leu 


\jin 


Leu 


Axa 


xxe 
185 


Phe 


Leu 


vai 


Tyr 


Cys 

200 


Met 


Tlur 


xie 


Xjeu 


ijiXy 
215 


Leu 


Pro 


Arg 




Arg 
230 


Tyr 


j>er 


vax 


\»iy 


(ciXn 
245 




Asp 


Trp 


nxs 


Tl a 

xxe 

260 




Leu 


irne 


Leu 


Ser 
275 


Leu 




Tl t> 


Asn 


Asn 
290 


Arcf 


VQ J- 


Hi « 




305 


Thy 


X JHU. 


(^1 11 

vvJt LI 


Leu 


Vd J- 

320 






He 


Lys 


Thr 
335 


Leu 




Met 




Val 
350 


Thr 


Val 


Pro 


Phe 


Tyr 
365 


Ser 


Asn 


Val 


Ser 


Leu 
380 


Thr 


Ala 


Arg 


Cys 


Val 
395 


Arg 


He 


Ser 


Gin 


He 
410 


Leu 


val 


Asn 


Thr 


Phe 
425 


val 


Leu 


Ala 


Thr 


Leu 
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Leu 


Leu 


Asp 


Gin 


Val 
10 


•L JiC 


Ala 


Ph«» 

Ir lie 




25 


vax 


Leu 


v»XU 


Asn 


xrne 
40 


If b1 

vax 


Pro 


Leu 


Leu 


Asp 
55 


Tnr 


L6U 


Ser 


Lys 


Asp 

70 


Ser 


Asn 


Leu 


Arg 


Pro 
85 


Trp 


Gin 


Leu 


Leu 


His 
100 


Pro 


Asp 


Thr 


Glu 


Pro 
115 


Ser 


Pne 


Leu 


Ser 


Thr 
130 


Ser 


Gin 


Ser 


Xfeu 


Lys 
145 


Ser 


Leu 


Leu 


GXy 


Gly 
160 


Gly 


Arg 


Lys 


xxe 


T 1 «. 

lie 
175 


ser 


Asn 


Thr 


Cys 


Ala 
190 


ixe 


Leu 


Arg 


Pne 


Leu 

205 


Asn 


Thr 


Pne 


He 


Leu 
220 


Ser 


Met 


Thr 


Tl «. 

xXe 


Met 

235 


Met 


Leu 


Leu 


Gly 


Gly 
250 


Leu 


Gin 


Leu 


Thr 


vai 
265 


Ser 


Trp 


Lys 


wee 


vax 
280 




L6U 


Asp 


laXU 


laxy 
295 


Asn 


f21 vr 


Lys 


Lys 


Asn 
310 


si 








Xfj^Ef 

325 




TIa 


Php 




340 


Ph^ 


CSl V 


Leu 




355 


Gly 


Leu 


He 


Leu 


Asn 
370 


Phe 


Gin 


He 


Leu 


Cys 
385 


Ser 


Leu 


Leu 


Thr 


Leu 
400 


Leu 


Phe 


Thr 


Phe 


Pro 
415 


Leu 


Pro 


Gin 


Glu 


Met 
430 


Gly 


He 


Gly 


Ser 


val 



Glv 


Glv 


Met 


Glv 
\axy 


15 


lie 


Thr 


Asn 


He 


30 




Ala 




X nr 


Pr-r% 

45 


Asn 


Asp 


Ji nr 


WaI 

vax 


oer 
60 


Asp 


Leu 


Leu 


Arg 


Tl A 

xxe 
75 


r»l Tl 


Lys 


Cys 


oxn 


Arg 
90 


Leu 


Asn 


Giy 


rnr 


Fne 
105 


Cys 


VaX 


Asp 


Gly 


Trp 
120 


Tl A. 

xxe 


vax 


Thr 




Trp 
135 


Ser 


nec 


vax 


%3±Xl 


Ser 
150 


Lieu 


Tl A 

xxe 


'lyr 


/^i ■■■■ 

tixy 


His 

165 


Cys 


Lys 


Leu 


Cys 


Phe 
180 


AX a 


Fne 


K 1 a 

Axa 


Pro 


inr 
195 


AX a 


fexy 


Fne 


3er 


Tnr 
210 


Ser 


Leu 


GiU 


Trp 


Tnr 
225 


tr>=m 1 

vax 


Leu 


Leu 


Cys 


Caw 

&>er 
240 


Leu 


Axa 


Fne 


Axa 


xxe 

255 


Ser 


Thr 


Pro 


Tl A 

xxe 


Tl A 

xxe 
270 


IjXU 


Ser 


Al s 


Arg 


Trp 
285 


Leu 


Lys 


r*! 11 


Leu 


Arg 
300 


Thr 


Glu 


Glu 


Thr 


Leu 

315 


X*jf D 




Leu 


Asn 


Ala 
330 


Phe 


Arg 


Ala 


Pro 


Lys 
345 


Val 


Aror 


Phe 


Ala 


He 
360 


Leu 


Gin 


His 


Leu 


Gly 
375 


Gly 


Ala 


Val 


Thr 


Phe 
390 


Asn 


His 


Met 


Gly 


Arg 
405 


Val 


Gly 


Leu 


Phe 


He 
420 


Gin 


He 


Leu 


Arg 


Val 
435 


Ser 


Ala 


Ala 


Ser 


Asn 
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440 






445 










450 


Ser 


Ala 


Ser 


Val His 


His 


Asn 


Glu Leu Val 


Pro 


Thr 


He 


Leu 


Arg 








455 






460 










465 


Ser 


Thr 


Val 


Ala Gly 


He 


Asn 


Ala Val Ser 


Gly 


Arg 


Thr 


Gly 


Ala 








470 






475 
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Ala 


Leu 


Ala 


Pro Leu 


Leu 


Met 


Thr Leu Met 


Ala 


Tyr 
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Pro 


His 
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Pro 
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Val Phe Pro 


He 


Leu 
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Val 


Pro 
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Leu 


Leu Leu 


Pro 


Glu 


Thr Arg Asp 


Leu 


Pro 


Leu 


Pro 


Asn 
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520 










525 


Thr 
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Gin 


Asp Val 
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Asp Arg Lys 


Asp 


Ser 


Arg 


Asn 


He 
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Asp Thr 
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Met 


Lys val Thr 


Gin 


Phe 














545 






550 













<210> 33 
<211> 2365 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 7484831CB1 

<400> 33 

atggagcctg gtgacgcggc gcgccctggc tcgggtcggg ctaccggggc gccgccgccg 60 
cggctgctgc tgctgccgct gctgctgggt tgggggctgc gagtcgcggc cgcggcctcg 120 
gcctcctcct ctggggcggc ggccgaggac agcagcgcca tggaggagct cgctactgag 180 
aaggaggcgg aggagagcca ccggcaagac agcgtgagcc tgctcacctt catcctgctg 240 
ctcacgctca ccatcctcac catCtggctc ttcaagcacc gccgggtgcg ctttctgcac 300 
gagaccgggc tggccatgat ctatgggctc atcgttgggg tgatectgag gtatggtacc 360 
cctgctacea gtggccgtga caaatcactc agctgcactc aggaagacag ggccttcagt 420 
accttattag tgaatgtcag cggaaagttc ttcgaataca ctctgaaagg agaaatcagt 480 
cctggcaaga tcaacagcgt agagcagaat gatatgctac ggaaggtaac attcgatcca 540 
gaagtatttt tcaacattct tctgcctcca attatttttc atgctggata cagcttaaag 600 
aagagacact ttttcagaaa tcttggatct atactggcct atgccttctt ggggactgct 660 
gtttcatgct tcattattgg aaatctcatg tatggtgtgg tgaagctcat gaagattatg 720 
ggacagctct cagataaatt ttactacaca gattgtctct tttttggagc aatcatctct 780 
gccactgacc cagtgactgt gctggcgata tttaatgaat tgcatgcaga cgtggatctt 840 
tacgcacttc tttttggaga gagcgtccta aatgatgctg ttgccattgt actgtcctcg 900 
tctattgttg cctaccagcc agcgggactg aacactcacg cctttgatgc tgctgccttt 960 
tttaagtcag ttggcatttt tctaggtata tttagtggct cttttaccat gggagctgtg 1020 
actggtgttg tgactgctct agtgactaag tttaccaaac tgcactgctt ccccctgctg 1080 
gagacggcgc tgttcttcct catgtcctgg agcacgtttc tcttggcaga agcctgcgga 1140 
tttacaggtg ttgtagctgt ccttttctgt ggaatcacac aagctcatta cacctacaac 1200 
aatctgtcgg tggaatcaag aagtcgaacc aagcagctct ttgaggtgtt acatttcctg 1260 
gcagagaact tcatcttctc ctacatgggc ctggcactgt ttaccttcca gaagcacgtt 1320 
ttcagcccca ttttcatcat cggagctttt gttgccatct tcctgggcag agccgcgcac 1380 
atctacccgc tctccttctt cctcaacttg ggcagaaggc ataagattgg ctggaatttt 1440 
caacacatga tgatgttttc aggcctcagg ggagcaatgg catttgcgtt ggccatccgt 1500 
gacacggcat cctatgctcg ccagatgatg ttcacgacca ccctitctcat tgtgttcttc 1560 
actgtctgga tcattggagg aggcacgaca cccatgttgt catggcttaa catcagagtt 1620 
ggcgtcgagg agccctccga agaggaccag aatgaacacc actggcagta cttcagagtt 1680 
ggtgttgacc ccgatcaaga cccaccaccc aacaacgaca gctttcaagt cttacaaggg 1740 
gacggcccag attctgccag aggaaaccgg acaaaacagg agagcacatg gatattcagg 1800 
ctgtggtaca gctttgatca caattatctg aagcccatcc tcacacacag tggtccccca 1860 
ctaaccacca cgctccccgc ctggtgtggc ttactagctc gatgtctgac cagtccccag 1920 
gtgtacgata accaagagcc actgagagag gaagactctg atttcatcct gaccgaaggc 1980 
gacctgacat tgacctacgg ggacagcaca gtgactgcaa atggctcctc aagttcgcac 2040 
accgcctcca cgagtctgga gggcagccgg agaacgaaga gcagctcgga ggaagtgctg 2100 
gagcgagacc tgggaatggg agaccagaag gttccgagcc ggggcacccg cctagtgttt 2160 
cccctggaag ataatgcttg aetttccccc caagccctgg cgcgatggga ctgaagattg 2220 
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aaaatacatc cacgaacatt tcaacatgga 
atccataaag aagtagttat gaaatgttca 
tttttgatta atctgaggaa aggag 



acgaagaatt atagttcctt ctctggctat 
aaaccaaagg caaatagtgt tatactctta 



2280 
2340 
2365 



<210> 34 
<211> 3400 
<212> DNA 

<2X3> Homo sapiens 
<220> 

<221> mlsc.feature 

<223> Incyte ID No: 2477266CB1 

<400> 34 

agcggcggct cagttgctgc catggagtca 
ctgctaccgg acaccgagcg catccgtcgg 
gcccccgccg ctttgccggc tctctgcgac 
cgccagtttg cggccgtgct gacccgcaga 
gcggagcaac gggagagcct caagtccctg 
cactgtgtga gcctcagcct ggcccagctc 
gaggcctggc cacagctttt gcagctgctt 
gagagagaga tggggctttt gctgctaagt 
caaccccacc accgggagct tcttcggctt 
cctgggctgc tcttctactc cctgcgcact 
gaagatgtgc ctctcgctcg gatgttggtg 
atccccatag atgaggcaaa ggcctgtgag 
tcagaggtgc cggtcatcac cccctacctc 
gctagaaatg tggccctggg caatgcgata 
ttggtcaaag tcaagageaa ggccttactg 
acccttttcc ccattgtggc tgctgagccc 
gattcagaag aggaagagtt ggagattgag 
gtacaagttg tggacatgct ggcactacac 
atgcccatgt tggaagaggc tttgcggagc 
ctggtgctgg ccgtgctgtc tgacggagct 
ccactgctgc agattgtgtg caagggcctg 
gcgctgtttg ccctgggcca gttctcagaa 
agggaggtaa tgccactgct cctcgcctac 
cacctagcca aggcctgcta tgccctggag 
cagccctacc ttccggagct tatggaatgc 
ccccgggcca aggagctggc tgtgagcgcc 
tcgctgctgc cctacttccc tgccatcatg 
cgtgaggacc ttcagcctgt gcagatccag 
gcagtggggg agcccatgag gccgctggct 
tgcgaccagg tagacgacgc tgacttgcgg 
tcgggtctga tgggtgaggg cctggcgccc 
ctgtcactgc gttccaccga gggcattgtg 
ctgtttgacg atgagagtga tggggaagaa 
gaagaggatg actcagagat ctcagggtac 
gaagacacct gtgctgccgt gggggagatc 
tacatggaaa gtgtctttga agaagtattt 
cggaaggcag cccatgaggc tctgggtcag 
agctgcccct cggaacccaa cactgctgct 
tcctacatgc aggcagtgaa cagggagcgg 
gccctgacag gggtgctccg cagctgtggg 
gctgagctct gtggcgtgct caaggctgtg 
gacgaggagg aggaagagga agatgatgat 
cacgctggag aggccatccc tgccctggca 
ttctttgccg gtttcctgcc attattggtg 
gagaagtcct ttgcagtggg gaccttggca 
gcccagtttg tgtctcggct gctccctgtg 
gaggtgcgaa gcaatgccat cttcgggatg 
gcccaggaac acttccccaa gctgctgggg 
catgatcgtg tccgtgacaa catetgtggg 



gccgggctag agcagctcct acgggagctg 60 
gccacggaac agctccagat cgttcttcgg 120 
ctgctagcct cggcggccga cccccagatc 180 
cgactgaaca cccgctggcg acggctggcg 240 
atcctgacgg ccctgcagag agaaacagag 300 
tcagccacca tttttcgaaa ggaaggcttg 360 
cagcacagta cccacagccc ccacagccca 420 
gtggtggtga cctcccggcc cgaggccttc 480 
ctgaatgaga ctcttggtga ggtgggctct 540 
ctgaccacca tggctcccta cctcagcact 600 
cccaagctga tcatggccat gcagactctg 660 
gcGcttgagg ctttggatga actgttggag 720 
tctgaagtcc tcacattctg cctggaggta 780 
cgcatacgta ttctctgctg cctcactttc 840 
aagaatcgtc tcctgccacc cttgctgcac 900 
ccaccaggcc agttggatcc cgaggaccag 960 
ctgatggggg agactcccaa gcatttcgct 1020 
ctgccccccg agaagctctg tccccagctg 1080 
gagagcccat accagcgcaa agctggactc 1140 
ggcgaccaca tcaggcagag actgctgccc 1200 
gaggacccct cgcaagttgt acgcaatgct 1260 
aacctacagc cccatatcag cagctattca 1320 
ttgaagtcgg tgcctcttgg acacacacac 1380 
aattttgtgg agaacctagg gcccaaggtg 1440 
atgctgcagc ttctgaggaa ccccagcagt 1500 
ctgggagcca ttgctacggc tgcccaggcc 1560 
gagcacctgc gggaattcct gttaacaggc 1620 
agcctggaga cactgggggt gctggcacga 1680 
gaggaatgct gccagctggg tctgggcctc 1740 
cgctgcacgt acagcctatt tgcagcctta 1800 
cacttggaac agatcaccac gctcatgctg 1860 
cctcagtatg acgggagcag ctccttcctt 1920 
gaggaggagc tcatggatga ggatgtggaa 1980 
agcgtggaga atgccttctt cgatgagaag 2040 
tctgtgaaca ccagtgtggc cttccttcca 2100 
aaactgctgg agtgccctca cctgaatgtg 2160 
ttttgctgtg cactgcacaa ggcctgtcaa 2220 
ttgcaggctg ccctggcccg agtcgtgcca 2280 
gaacgccagg tggtgatggc cgtgctggag 2340 
accctcacac tgaagccccc tgggcgcctc 2400 
ctgcagagga agacagcctg tcaggatact 2460 
caggctgaat acgacgccat gttgctggag 2520 
gccgcggctg ggggagactc ctttgcccca 2580 
tgcaagacaa aacagggctg cacagtggca 2640 
gagactattc agggcctggg tgctgcctca 2700 
ctgttgagca ccgcccaaga ggcagacccc 2760 
ggcgtgctgg cagagcatgg gggccaccct 2820 
ctcctttttc ccctcctggc gcgggagcga 2880 
gcacttgccc gcctgttgat ggccagtccc 2940 
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accaggaaac cagagcccca ggtgctggct gccctactgc atgccctgcc actgaaggag 3000 

gacttggagg agtgggtcac cattgggcgc ctcttcagct tcctgtacca gagcagccct 3060 

gaccaggtta tagatgCggc tcccgagctt ctgcgtatct gcagcctcat tctggctgac 3120 

aacaagatcc caccagacac caaggccgca ctgttgctgc tcctgacgtt cctggccaaa 3180 

cagcacaccg acagctttca agcagctctg ggctcactgc ctgttgacaa ggctcaggag 3240 

ctccaggctg tactgggcct ctcctagact gcaggctgca gccagtccag agagaataga 3300 

gcctgcccag gccttaagac cacctctcag cccagttcag ttctgcctta ccaaagattc 3360 

tgagactcat acccatttgg agccagcccc acttgctgcc 3400 

<210> 35 

<211> 4458 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> niisc_feature 

<223> Incyte ID No: 3552033CB1 

<400> 35 

gttgttgcta gtatgatata gatcatgtgt acgtctatgt tagtgctatg tacgttgtta 60 
tagtgtcttg cttgttgtat ctatattgtt attgtttcgt tttgtgtgtg tatatattat 120 
tatcgtttgc cccgtttaat taccctccat ggggccccga gttttatttt ccccccttgg 180 
tgggatgcgt cttcaatcac cgcttttata ccctgatatt cggttcaagg gactttctcc 240 
gggggctctt tcggcaccac cgcaagaacc ggtctcagct caaatcccgg aatgaaggcc 300 
tttaaaaaga agatgtgccc aacagtgaaa ccccgccccc aggccccata ggtttccttg 360 
ggcttctgat aggcccagca ttggccctca cccccagtat ctccggcttc agaattgaga 420 
aaaatgaaag tcgcctctcc cgaaaggaca tccagtctga aaagtggtcc atcagcaaac 480 
acactcaact cagccctacg gatgcttttg ggaccattga gttccaagga ggtggccatt 540 
ccaacaaagc catgtatgtg cgagtatctt ttgatacaaa acctgatctc ctcttacacc 600 
tgatgaccaa ggaatggcag ttggagcttc ccaagcttct catctctgtc catgggggcc 660 
tgcagaactt tgaactccag ccaaaactca agcaagtctt tgggaaaggg ctcatcaaag 720 
cagctatgac aactggagcg tggatattca ctggaggggt taacacaggt gttattcgtc 780 
atgttggcga tgccttgaag gatcatgcct ctaagtctcg aggaaagata tgcaccatag 840 
gtattgcccc ctggggaatt gtggaaaacc aggaggacct cattggaaga gatgttgtcc 900 
ggccatacca gaccatgtcc aatcccatga gcaagctcac tgttctcaac agcatgcatt 960 
cccacttcat tctggctgac aacgggacca ctggaaaata tggagcagag gtgaaacttc 1020 
gaagacaact ggaaaagcat atttcactcc agaagataaa cacaagaatc ggtcaaggtg 1080 
ttcctgtggt ggcactcata gtggaaggag gacccaatgt gatctcgatt gttttggagt 1140 
accttcgaga cacccctccc gtgccagtgg ttgtctgtga tgggagtgga cgggcatcgg 1200 
acatcctggc ctttgggcat aaatactcag aagaaggcgg actgataaat gaatctttga 1260 
gggaccagct gttggtgact atacagaaga qtttcacata cactcgaacc caagctcagc 1320 
atctgttcat catcctcatg gagtgcatga agaagaagga attgattacg gtatttcgga 1380 
tgggatcaga aggacaccag gacattgatt tggctatcct gacagcttta ctcaaaggag 1440 
ccaatgcctc ggccccagac caactgagct tagctttagc ctggaacaga gtcgacatcg 1500 
ctcgcagcca gatctttatt tacgggcaac agtggccggt gggatctctg gagcaagcca 1560 
tgttggatgc cttagttctg gacagagtgg attttgtgaa attactcata gagaatggag 1620 
taagcatgca ccgttttctc accatctcca gactagagga attgtacaat acgagacatg 1680 
ggccctcaaa tacattgtac cacttggtca gggatgtcaa aaaggggaac ctgcccccag 1740 
actacagaat cagcctgatt gacatcggcc tggtgatcga gtacctgatg ggcggggctt 1800 
atcgctgcaa ctacacgcgc aagcgcttcc ggaccctcta ccacaacctc ttcggcccca 1860 
agaggcccaa agccttgaaa ctgctgggaa tggaggatga tattcccttg aggcgaggaa 1920 
gaaagacaac caagaaacgt gaagaagagg tggacattga cttggatgat cctgagatca 1980 
accacttccc cttccctttc catgagctca tggtgtgggc tgttctcatg aagcggcaga 2040 
agatggccct gttcttctgg cagcacggtg aggaggccat ggccaaggcc ctggtggcct 2100 
gcaagctctg caaagccatg gctcatgagg cctctgagaa cgacatggtt gacgacattt 2160 
cccaggagct gaatcacaat tccagagact ttggccagct ggctgtggag ctcctggacc 2220 
agtcctacaa gcaggacgaa cagctggcca tgaaactgct gacgtatgag ctgaagaact 2280 
ggagcaacgc cacgtgcctg cagcttgccg tggctgccaa acaccgcgac ttcatcgcgc 2340 
acacgtgcag ccagatgctg ctcaccgaca tgtggatggg ccggctccgc atgcgcaaga 2400 
actcaggcct caaggtaatt ctgggaattc tacttcctcc ttcaattctc agcttggagt 2460 
tcaagaacaa agacgacatg ccctatatgt ctcaggccca ggaaatccac ctccaagaga 2520 
aggaggcaga agaaccagag aagcccacaa aggaaaaaga ggaagaggac atggagctca 2580 
tagcaatgtt gggacgaaac aacggggagt cctccaggaa gaaggatgaa gaggaagttc 2640 
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agagcgagca ccggttaatc cccctcggca 
tcgtgaagtt ctggttctac acactggcgt 
tcgtgttagt gaagatggaa cgctggccgc 
ttttcaccct gggaatagaa aagatgagag 
tacagaaagt gaaggtatgg ctgcaggagc 
ttctgttttc tgtcggaatg atccttcgtc 
gggtcatcta ctgcgtgaac atcatttact 
tgaacaagta tttgggcccg tatgtaatga 
actttgtcat cattatgctg gtggttctga 
tttttcccaa tgaggagcca tcatggaaac 
ggatgattta tggggaagtg tttgcggacc 
cccgagagga tggtaaaata atccagctgc 
cggccatcat ggcctgctac ctcttagtgg 
ctgtctttaa caatacattt tttgaagtaa 
agaggtatca gctcatcatg actttccatg 
tcttcagcca catgaccatg atattccagc 
gcgacccgga tgaaagggac tacggcctga 
aagtacatga ctttgaagag caatgcatag 
tcaactcatc taatgatgag aggatacggg 
tgcggctgga ggaagtcaac gagagagagc 
acatccggct ggcgcagctg gaagacctta 
tgacaggtct ggagcgggcc gagtccaaca 
cggacgccgc ctacattgtc cgtcagagca 
agctccaaga gagtatagac cctgcagagc 
agcagttcta cctaaegtcc cacatocttc 
taaagaaaaa acaaggtaac agtcatagtt 
ccacaactgt ttatcctttt ttgaagaaaa 
aatcgaaatg aaagaagagt tgatgttaaa 
agtgggcgtt caagtctagt cgagcaagct 
ttttaattta ttttgtatct cacctgtcat 
tgaaccttaa aaaaaaaa 

<210> 36 
<211> 2722 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> incyte ID No: 4778139CB1 



gaaaaatcta tgaattctac aatgcaccca 2700 
atatcggata cctgatgctc ttcaactata 2760 
ccacccagga atggatcgta atctcctata 2820 
agattctgat gtcagagcca gggaagttgc 2880 
actggaatgt cacggacctc atcgccatcc 2940 
tccaagacca gcccttcagg agtgacggga 3000 
ggtatatccg tctcctagac atcttcggcg 3060 
tgattggaaa aatgatgata gacatgatgt 3120 
tgagctttgg ggtcgccagg caagccatcc 3180 
tggccaagaa catcttctac atgccctatt 3240 
agatagaccc tccctgtgga cagaatgaga 3300 
ctccctgcaa gacaggagct tggatcgtgc 3360 
caaacatctt gctggtcaac ctcctcattg 3420 
aatcgatatc caaccaagtc tggaagtttc 3480 
aaaggccagt tctgccccca ccactgatca 3540 
acctgtgctg ccgatggagg aaacacgaga 3600 
aactcttcat aaccgatgat gagctcaaga 3660 
aagaatactt cagagaaaag gatgatcggt 3720 
tgacttcaga aagggtggag aacatgtcta 3780 
actccatgaa ggcttcactc cagaccgtgg 3840 
tcgggcgcat ggccacggcc ctggagcgcc 3900 
aaatccgctc gaggacctcg tcagactgca 3960 
gcttcaacag ccaggaaggg aacaccttca 4020 
atcctcttta ttcagtataa gccggcagca 4080 
tcatgccaac acttctgtaa ttga tea tta 4140 
cacctgtctc ttatctattc acttctggtg 4200 
taagggaaca gaaatgcctt tttgtattgc 4260 
aaaacaaaag tcaagtgatt tattatatac 4320 
caggagaatg taattaaata attttatatt 4380 
ggatgaattc attcactgaa tatgtaatat 4440 

4458 



<220> 

<22l> unsure 
<222> 2609 

<223> a, t, c, g, or other 



<400> 36 

atggagatcg ctggagagaa 
agcctggctg gaggccggca 
aacagtaaaa gagaggctac 
agtatagtgg tttctggggt 
cttctggtcc cctctaatct 
tccagaccct ctgtgccaga 
gttcatctga ctgggcagac 
gaaaagcacg agcctattgc 
cccactttga gttgcacacc 
atcatgcaaa ctggaggtgg 
gtgtttagca tcaagaaaaa 
tctcggagtg aacagcacca 
agatcggccc agagtagtga 
ttttggatca atcctctccg 
cataaggagg agaaattgat 
atagggcagg gagcatgggt 



gaaataccag tattatagca 
tcccccaaga gtgcagaggg 
aaatgtaaga aatgaccagg 
gtctcctaac aggcaagcac 
aaagcgagtt cctttttcag 
tgtgctggca actgaacagg 
Gccagttgtc tctgactggg 
tcacttactg gatggacaag 
tgaacccatg acaatgagct 
atatgtaaac tgggcatttt 
gtggcaaacc tgcttgccct 
gaagcaggcc caggacagct 
atgctcagag gtgggaccat 
cagatacagg cccttcgcta 
gaagatctgt aagattaaaa 
caaagcgaaa atgctaacca 



tgccctcttc tttgctgagg 60 
gggcacttct tgagattaca 120 
aaaggcaaga aacacaaagt 180 
actcaaagta tggccagttt 240 
cagaaactgt cttgcctctg 300 
acatccagac tgaggttctt 360 
catcagtgga tgaacccaag 420 
acaaggcaga gcaagtgcta 480 
cccctctttc ccaagccaag 540 
cagaaggtga tgaaactggt 600 
ccacttgtga cagtgattcc 660 
ccctatctga taactcaaca 720 
ggcttcagcc aaacacatcc 780 
ggagtcatag ttttagattc 840 
atctttcagg ctcttcagaa 900 
aagacaggag actgtcaaag 960 
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aaaaagaaga atactcaagg actccaggtg 
agtgaccagt tgaatccaga gccaggagaa 
aactggttca cagtgtccaa atttagtcac 
atgaaaacta aagaaattgg acaatgtgct 
caagagtctg ctcaggatct cagcaaaaac 
aataggaact ccctgtcctc tctaatttca 
gagcctcacc atcattattc accttcactt 
tacaggttgg aggagagttc ccctttaaac 
cgtgggagag cggcaatgat ccaggtattg 
aaagctatga gagtcgtcag cacttggtct 
ttcattgtca agtcctttct tcctgaggtt 
agcactgtgc ttcatctttg cctcagggaa 
atctatacct tcaaccaagt gaaaccacaa 
gttttcttaa tctactgcca ttcagccaac 
ggggagttcc ggaagtataa caacaacaat 
gaggagctga tgttggcttt ctctcactgg 
gttttagatt tgcaaggtgt tggagaaaat 
gtcaaacaat caagaggaat ggtgtttgga 
aacttcattg caaaacatca ttggaactcc 
aaaagaaatg actattcccc tgaaaggata 
gaatcagctg aggagcctcc agcaagggag 
caactataaa aagggaggag caagaagatc 
gtgataacat agattgatca acgtgatgtt 
ttctgagcct cacatctcct tctgttcaaa 
cctagacact gacctctgtc cagggcactt 
atgcttggat gagtcagctg ggaatattgt 
cttggaactt tggagcagaa tcctgcacat 
ttttattttc tcactatgta tgcagactng 
cagattgtat ttatgtctat atatatgttt 
aaaaaaaaaa aaaaaaaaaa aa 

<210> 37 

<211> 1924 

<212> DNA 

<213> Homo sapiens 



ccaatcataa cagtcaatgc ctgctctcag 1020 
aacagcatct ctgaagagga gtacagcaag 1080 
acaggtgtag aaccttacat acatcagaaa 1140 
atacaaatca gtgattacct aaagcagtct 1200 
tctttgtgga attccaggag caccaacctc 1260 
gagatctcag cctccttaaa aagccctcaa 1320 
ctctttgcag caggagaaga aataactgtc 1380 
cttgataaaa gcatgtcctc ttggtctcag 1440 
tcccgagagg agatggatgg gggcctccgt 1500 
gaggatgaca ttctcaagcc gggacaagtt 1560 
?rtgcggacat ggcataaaat cttccaggag 1620 
attcaacaac aaagagctgc tcaaaaattg 1680 
accataccct acacaccaag gttcctggaa 1740 
cagtggttga ccattgagaa gtatatgaca 1800 
ggtgatgaaa tcacccccac caacaccctg 1860 
acctatgagt acactcgggg agagctgctg 1920 
ttgacagatc catctgttat aaaacctgaa 1980 
ccggccaatt tgggggaaga tgcaatCaga 2040 
tgctgccgga agctcaaact cccggattta 2100 
aattccacct ttggacttga gataaaaata 2160 
acgggtagaa attccccaga agatgatatg 2220 
ccagtgcttg ccctgcctgc caggaactct 2280 
gattacatca gcgtctcctt gggacacgcc 2340 
ggcctcattg gtatatgatc aatgggttct 2400 
tgcagctcca tcctcaagtt ccacacgaag 2460 
tcttgtgtac ctcattgctt tagctggtca 2520 
taaaggatgg ggttgggggg gatacattta 2580 
accccctact actatttgtc acctcaccca 2640 
cataaaaagc ttaccaaacc caaaaaactaa 2700 

2722 



<220> 

<221> misc.feature 

<223> Incyte ID No; 4787433CB1 



<400> 37 

cggcgcccag 

gtggggcact 

tcttgtggta 

aggcccccag 

gacaccgggg 

ggtgagatct 

ggctatgaat 

cagaacacat 

cttgcaatgc 

ggctactttt 

tttttgtttg 

ttttttcttg 

ttcattggca 

ggaaccctgc 

atcctctcca 

ttttggcttc 

aagtggaaca 

cctgttagta 

tggagcatta 

ttctactcgt 

ctcctgggtg 

909aggagaa 



tgcacggccg 
tcggcagatt 
ttcactactt 
gcaatgtgag 
ccctgttgtc 
gggagctctc 
acactggcag 
ggaaaagcac 
tgatGcagcc 
ctgacaggct 
gaatagcagc 
ccatggttgc 
tgaagtctcg 
tggtggcttt 
cagtgactgt 
tctcagaggg 
gggcaagctc 
atagccccac 
cgaaaaggac 
tttccttgaa 
tagtggaaat 
cagtcctggc 



caccatgggg 
ccagagagtc 
ggcttctgtg 
tcaggttgtt 
ttcaggccag 
aaggtgtagc 
taagaaagag 
tgcggtgacc 
cctatttatg 
aggacgccgg 
ggcgtttgca 
aagtggctat 
gacatgggcg 
gacaggatac 
cccctttatc 
acgatatgaa 
ctgtaaactg 
tgaagttcag 
acttaccgtt 
ttctgttaac 
tcccgcctac 
ctactctctt 



tcccgccact 
ctctatttca 
ttcatgggag 
ttccataatc 
aaagattatg 
aggaataaga 
tttccttgtg 
cagtggaacc 
tttggagtcc 
gtggtcttgt 
gttgattatt 
cttgtggtgg 
tctgtccatt 
ttggtcagga 
ctgtgctgtt 
gaagcacaaa 
tcagaacttt 
aagcacaacc 
tggctaatct 
ttaggaggca 
accttcgtgt 
ttctgcagtg 



tcgaggggat 
tatgtgcctt 
tcacccctca 
actctaattg 
ttacggtgca 
gggagaacac 
tggatggcta 
tggtctgtga 
tactgggatc 
gggccacaag 
acacGttcat 
ggtttgtcta 
tgcattcctt 
cctggtggct 
gggtgctccc 
aaatagttga 
tatcactgga 
tatcatatct 
ggttcactgg 
atgaatactt 
gcatcgccat 
cactggcctg 



ttatgaccac 
ccagaacatc 
tcatgtctge 
gagtttggag 
gttgcagaat 
atcgagtttg 
catatatgac 
ccgaaaatgg 
ggtgactttt 
cagtagcatg 
ggctgctcgc 
tgtgatggaa 
ttttgcagtt 
ttaccagatg 
agagacacct 
catcatggcc 
cctacaaggt 
gttttataac 
aagtttggga 
aaacctcttc 
ggacaaggtc 
tggtgtcgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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atggtgatcc cccagaaaca ttatattttg 
gccatcgggg cagcatttgg cctcatttat 
gtaagatcgc tggctgtggg aagcggcagc 
ccgttctctg tggacctcag cagcatttgg 
atggccctcc tgagtggagt gttaacacta 
gcaactactt gggaggaggc tgcaaaactg 
ttaettctca caactaataa tagtgggctg 
tctggtcttg gtgaataaat gtgccatgcc 
ctaatgcctt tgtattagag gaatcttatt 
tttaataaat tttgtaagaa aattttaaag 
gatg 



ggtgtggtga cagctatggt tggaaaattt 1380 
ctttatacag ctgagctgta tccaaccatt 1440 
atggtgtgtc gcctggccag catcctggcg 1500 
atcttcatac cacagttgtt tgttgggact 1560 
aagcttccag aaacccttgg gaaacggcta 1620 
gagtcagaga atgaaagcaa gtcaagcaaa 1680 
gaaaaaacgg aagcgattac ccccagggat 1740 
tgctgtctag cacctgaaat attatttacc 1800 
ctcatctccc atatgttgtt tgtatgtctt 1860 
caaatatgtt ataaaagaaa taaaaactaa 1920 

1924 



<210> 38 
<211> 1797 
<212> DNA 
<213> Homo sapiens 



<22Q> 

<221> mlscfeature 

<223> Incyte ID No: 7483598CB1 



<400> 38 

agcatcGCcg tcccggagga aaaaacattt 
aggctgccac agcctagaga tcttggggct 
ggggcgcccg tggaactggc tccctgcagc 
gcgaggtggc ggaggaggct ggccgggtgc 
ctgctgagga ccaccgctca gccatgggct 
agagagattt agatgacaga gaaacccttg 
gtcagtctgc tgctcttttt aatgttgtca 
tgccttattc aatgaagcaa gctgggtttc 
catatgttac agacttttcc cttgttttat 
atacctacca gtctttggtc aataaaactt 
ttcttcagtt tttgtatcct tttatagcaa 
ctttgagcaa agtttttcaa agaatcccag 
gccacttcat tattggactt tccacagtta 
atatagcaaa gcttggaaag gtctccctca 
gaattgtaat ggcaagggca atttcactgg 
gggtatttgc aaagcccaat gccattcaag 
gccaccataa ctccttctta gtttacagtt 
cccgccttat ccatatgtcc atcgtgattt 
gtggatactt gacatttact ggcttcaccc 
atgatgacct ggtaacattt ggaagatttt 
ctatggaatg ctttgtgaca agagaggtaa 
catcggtttt ccacattgtt gtaacagtga 
tgctgattga ttgcctcggg atagttctag 
tcatttttat cattccatca gcctgttatc 
ccgataagat tatgtcttgt gtcatgcttc 
tcgtcatggc tattacaaat actcaagact 
ttcctgacaa tttctctctc acaaatacct 
tttctacttt aaatattagt atctttoaat 
tcatagactt taaaacacat aacatttacg 
aattattatt ttggctttta tcaagacttg 



gtctggcgaa ctccgggtgg aaagcgcccc 60 
tcagcccctc gcggcctgcc gagggagcag 120 
tctgcggcta cacgcggacc tcggctgtgt 180 
gaatccgtac ccagccccag catcttccac 240 
accagaggca ggagcctgtc atcccgccgc 300 
tttctgaaca tgagtataaa gagaaaacct 360 
actcgattat aggatctggt ataataggat 420 
ctttgggaat attgctttta ttctgggttt 480 
tgataaaagg aggggccctc tctggaacag 540 
tcggctttcc agggtatctg ctcctctctg 600 
tgataagtta caatataata gctggagata 660 
gagttgatcc tgaaaacgtg tttattggtc 720 
cctttactct gcctttatcc ttgtaccgaa 780 
tctctacagg tttaacaact ctgattcttg 840 
gtccacacat accaaaaaca gaagacgctt 900 
cggtcggggt tatgtctttt gcatttattt 960 
ctctagaaga acccacagta gctaagtggt 1020 
ctgtatttat ctgtatattc tttgctacat 1080 
d^ggggactt atttgaaaat tactgcagaa 1140 
gttatggtgt cactgtcatt ttgacat:acc 1200 
ttgccaatgt gttttttggt gggaatcttt 1260 
tggtcatcac tgtagccacg cttgtgtcat 1320 
aactcaatgg tgtgctctgt gcaactcccc 1380 
tgaaactgtc tgaagaacca aggacacact 1440 
ccattggtgc tgtggtgatg gtttttggat 1500 
gcacccatgg gcaggaaatg ttctactgct 1560 
cagagtctca tgttcagcag acaacacaac 1620 
gagttgactg ctttaaaaat atgtatgttt 1680 
cttgctttag tctgtattta tgttatataa 1740 
gcttttatga gtagtgcaat ataaaaa 1797 



<210> 39 

<211> 3277 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_featvire 

<223> Incyte ID No: 7484d23CBl 



<400> 39 
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atgggggaag gaga tag tgg gttgaggagc ccaagaggcc cacaggaagg tgggaaatgc 60 
agtagggaaa aacagaagcg caatatggag gaactgaaga aggaagtggt catggatgat 120 
cacaaattaa ccttggaaga gctgagcacc aagtactccg tggacctgac aaagggccat 180 
agccaccaaa gggcaaagga aatcctgact cgaggtggac ccaatactgt taccccaccc 240 
cccaccactc cagaatgggt caaattctgt aagcaactgt tcggaggctt ctccctccta 300 
ctatggactg gggccattct ctgctttgtg gcctacagca tccagatata tttcaatgag 360 
gagcctacca aagacaacct ctacctgagc atcgtactgt ccgtcgtggt catcgtcact 420 
ggctgcttct cctattatca ggaggccaag agctccaaga tcatggagtc ttttaagaac 480 
atggtgcctc agcaagctct ggtaattcga ggaggagaga agatgcaaat taatgtacaa 540 
gaggtggtgt tgggagacct ggtggaaatc aagggtggag accgagtccc tgctgacctc 600 
cggcttatct ctgcacaagg atgtaaggtg gacaactcat ccttgactgg ggagtcagaa 660 
ccccagagcc gctcccctga cctcacccat gagaaccctc tggagacccg aaacatctgc 720 
ttcttttcca ccaactgtgt ggaaggaacc gcccggggta ttgtgattgc tacgggagac 780 
tccacagtga tgggcagaat tgcctccctg acgtcaggcc tggcggttgg ccagacacct 840 
atcgctgctg agatcgaaca cttcatccat ctgatcactg tggtggccgt cttccttggt 900 
gtcacttttt ttgcgctctc acttctcttg ggctatggtt ggctggaggc tatcattttt 960 
ctcattggca tcattgtggc caatgtgcct gaggggctgt tggccacagt cactgtgtgc 1020 
ctgaccctca cagccaagcg catggcgcgg aagaactgcc tggtgaagaa cctggaggcg 1080 
gtggagacgc tgggctccac gtccaccatc tgctcagaca agacgggcac cctcacccag 1140 
aaccgcatga ccgtcgccca catgtggttt gatatgaccg tgtatgaggc cgacaccact 1200 
gaagaacaga ctggaaaaac acttaccaag agctctgata cctggtttat gctggcccga 1260 
atcgctggcc tctgcaaccg ggctgacttt aaggctaatc aggagatcct gcccattgct 1320 
aagagggcca caacaggtga tgcttccgag tcagccctcc tcaagttcat cgageagtct 1380 
tacagctctg tggcggagat gagagagaaa aaccccaagg tggcagaggt tccctttaat 1440 
tctaccaaca agtaccagat gtccatccac cttcgggagg acagctccca gacccacgta 1500 
ctgatgatga agggtgctcc ggagaggatc ttggagtttt gttctacctt tcttctgaat 1560 
gggcaggagt actcaatgaa cgatgaaatg aaggaagcct tccaaaatgc ctatttagaa 1620 
ctgggaggtc tgggggaacg tgtgctaggc ttctgcttct tgaatctgcc tagcagcttc 1680 
tccaagggat tcccatttaa tacagatgaa ataaatttcc ccatggacaa cetttgtttt 1740 
gtgggcctca tatccatgat tgaccctccc cgagctgcag tgcctgatgc tgtgagcaag 1800 
tgtcgcagtg caggaattaa ggtgatcatg gtaacaggag atcatcccat tacagctaag 1860 
gccattgcca agggtgtggg catcatctca gaaggcactg agacggcaga ggaagtcgct 1920 
gcccggctta agatccctat cagcaaggtc gatgccagtg ctgccaaagc cattgtggtg 1980 
catggtgcag aactgaagga catacagtcc aagcagcttg atcagatcct ccagaaccac 2040 
cctgagatcg tgtttgctcg gacctcccct cagcagaagc tcatcattgt cgagggatgt 2100 
cagaggctgg gagccgttgt ggccgtgaca ggtgacgggg tgaacgactc ccctgcgctg 2160 
aagaaggctg acattggcat tgccatgggc atctctggct ctgacgtctc taagcaggca 2220 
gccgacatga tcctgctgga tgacaacttt gcctccatcg tcacgggggt ggaggagggc 2280 
cgcctgatct ttgacaacct gaagaaatcc atcatgtaca ccctgaccag caacatcccc 2340 
gagatcacgc ccttcctgat gttcatcatc ctcggtatac ccctgcctct gggaaccata 2400 
accatcctct gcattgatct cggcactgac atggtccctg ccatctcctt ggcttatgag 2460 
tcagctgaaa gcgacatcat gaagaggctt ccaaggaacc caaagacgga taatctggtg 2520 
aaccaccgtc tcattggcat ggcctatgga cagattggga tgatccaggc tctggctgga 2580 
ttctttacct actttgtaat cctggctgag aatggtttta ggcctgttga tctgctgggc 2640 
atccgcctcc actgggaaga taaatacttg aatgacctgg aggacagcta cggacagcag 2700 
tggacctatg agcaacgaaa agttgtggag ttcacatgcc aaacggcctt ttttgtcacc 2760 
atcgtggttg tgcagtgggc ggatctcatc acctccaaga ctcgccgcaa ctcacttttc 2820 
cagcagggca tgagaaacaa agtcttaata tttgggatcc tggaggagac actcttggct 2880 
gcatttctgt cctacactcc aggcatggac gtggccctgc gaatgtaccc actcaagata 2940 
acctggtggc tctgtgccat tccctacagt attctcatct tcgtctatga tgaaatcaga 3000 
aaactcctca tccgtcagca cccggatggc tgggtggaaa gggagacgta ctactaaact 3060 
cagcagatga agagcttcat gtgacacagg ggtgttgtga gagctgggat ggggccagag 3120 
attataagtt tgacacaaca tctgagacac taggatgaat tatcttggat gagaaagatg 3180 
ggcaatcctg ggctggcttg agggaatcat gggcagagga tgaggtgggc tgaagggaag 3240 
cccagcctgc atctagctgg agccccgcag ggagggg 3277 

<210> 40 
<211> 2773 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 
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<223> Incyte ID No: 143935CB1 
<400> 40 

atggaagaga tgtcaggaga aagtgtggtg agctcagcgg tgccagcggc tgctacccgc 60 
accacttcct tcaagggcac gagccccagc tccaaatacg tgaagctgaa tgtgggtgga 120 
gccctccact ataccaccat gcagacgctg accaagcagg acaccatgct gaaggccatg 180 
ttcagcgggc gcatggaagt gctcaccgac agtgaaggct ggatcctcat tgaccgctgt 240 
gggaagcact ttggtacgat actcaactac cttcgagacg gggcggtgcc tttacccgag 300 
agccgccggg agatcgagga gctgctagca gaagccaagt actacctagt ccaaggcctg 360 
gtggaagagt gccaggcggc cctacaaaac aaagatactt atgagccttt ctgcaaggtc 420 
cctgtgatca cctcatccaa ggaagaacaa aaacttatag cgacttcaaa taagccagcc 480 
gtgaagttgc tctacaacag aagtaacaac aaatactcat ataccagcaa ttctgacgac 540 
aatatgttga aaaacattga actgtttgat aagctgtctc tgcgctttaa cggaagggtc 600 
ctgttcataa aggatgttat tggggatgaa atctgctgct ggtcctttta tggtcagggc 660 
cggaagattg ctgaagtctg ttgtacctcc atcgtctatg ccactgagaa gaaacagacc 720 
aaggtggagt ttcccgaagc ccggatttat gaggagaccc tgaacatttt gctgtatgag 780 
gcccaggatg gccggggacc tgacaatgcg ctcctggagg ccacaggcgg ggcggcgggg 840 
cgctcccacc acctggacga ggacgaggag cgggagcgga tcgagcgcgt gcggaggatc 900 
cacatcaagc gccctgatga ccgggcccac ctccaccagt gagcaggcaa gagaccgagc 960 
cgccctcctc tcaccgcccc cactccctgc cgtgctacac ccagatcctg tgcaggctgc 1020 
cgggcccctt ctgcttccct tggagcctgg agatactttt gtaacaagcc agatgattat 1080 
tttggtattg cttgacaagg caaattgatt gtcttgaccc aggcgtatiga cccctgtcgt 1140 
tgaacaagct gtgtctaaga tctctacttt tcatgagaat ctgagactct ttggagccag 1200 
gctttctcgg ttctcagagg aaaagtatga atgagtgtga agtgtatgtg agaacttttg 1260 
tttgcaatat ttatttttgt gggtgtcgac ttcctatgtg ggctttttgg gtgacactcc 1320 
cttaagggtt cagtttgaca attctgagag ttgtcctgca gttggaggcc accagaggta 1380 
tctgagctcc ctgcttccta tttcataatc ctccagcccc agcaggtcca ctcctggttc 1440 
ctgtgtgttt ggcccgggca caatccccac tgctttgcta gacgtgcttt ctgccatgtg 1500 
gctttgggcc tagagcttgt tgataattgc agcttgtggc agtggaaata tggctgaatg 1560 
agcgtctaaa tcgttgagac cagtgcaact ttgggtgcaa ggctttgttt agggatcaag 1620 
ccttttgcca ccttgggctg gtctttggcc tggtgctcac tgggacccca tatgtctgcg 1680 
taggagcaga actttccatg gcagtaagtg tccagctctg tttctggttc tttccceaac 1740 
tccagccccg tccagttgtt ctcctgattg acccgactcc actccaggaa ggccatctga 1800 
ccctgtgaca ggcatagctc ataaactacc cctccctggg atcccgctcc tcttcagcct 1860 
ccttccccat gaagctgggc taactttcta agtcattttg cttagaaatt cagtgtggcc 1920 
catacccttt gtcctcccag cctggcatcc aggcagggac accctcacac caccagcccc 1980 
agggagcttc cctgctataa acacagaccc ccttgtcttt gcctctgatt tttacacagt 2040 
gtagagtggc cagcagtgaa caggttgagg atgtgcgggt agatagataa ctttgggtct 2100 
ggtttgtgtc tgtgttcatg tttgtttaag ggatatgtgt gactgtgggt ggggacgtgt 2160 
gcttgtgggg cacaggtggc ggcccctgct ggagcccggc tgggcgcagc gcctatgtag 2220 
gacgggtgtt ctcagtgacc tacctcccag gctcctctgc acctgcaaag gaacaggagt 2280 
gagtcgtgac tgacaggggt ggttgagact agactaggta gagtagttac caggagatgt 2340 
gaatgtgcgt caggtgatgg atgggtttgt caagggaatc gttaccgttt tataccaaag 2400 
gtattaacat gggcagcctt tgacacatgt attccaaaaa cgagtttata ttttcaaacg 2460 
gtttttacag cttagacttt gtacttactg ccctgcctgt gacagttgta tgccttcatt 2520 
ttgtatccaa cagcaaagtc cacaataaaa ctttaaaaca atcatgactg aatgtcaaaa 2580 
tcgtgtattg ggcagatgct ttttaaactg tcgtgtgaga aacttttata ttaggccatt 2640 
tggattttat taagtgctaa ggaaagaggg cttacaaaat gtttcgtaaa tattttatac 2700 
tgtttaagtg ttaaacacca accctgtctt tcttttgggt tgagcttttt tagaaagtcg 2760 
aagtgaatgt tgg 2773 

<210> 41 
<211> 2966 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5923789CB1 

<400> 41 

tttttccaag tgagagactg ccccatatgt tagtgcaata tgtcactgga ggtgaagcat 60 
cagttgtatt ggtgggaacc tgccgtttgc tgtccecttt ttcctcatgc cttttcctgc 120 
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ctctctgatc ttttctaggt ctctggccta tcaggaggac aactggtgct gcaatagaag 180 
ccagtggcta agtctcgtgt atggcgtggt taaggttgca gcctctcacc tctgccttcc 240 
tccattttgg gctggttacc tttgtgctct tcctgaatgg tcttcgagca gaggctggtg 300 
gctcagggga cgtgccaagc acagggcaga acaatgagtc ctgttcaggg tcatcggact 360 
gcaaggaggg tgtcatcctg ccaatctggt acccggagaa cccttcccct ggggacaaga 420 
ttgccagggt cattgtctat tttgtggccc tgatatacat gttccttggg gtgtccatca 480 
ttgctgaccg cttcatggca tctattgaag tcatcacctc tcaagagagg gaggtgacaa 540 
ttaagaaacc caatggagaa accagcacaa ccactattcg ggtctggaat gaaactgtct 600 
ccaacctgac ccttatggcc ctgggttcct ctgctcctga gatactcctc tctttaattg 660 
aggtgtgtgg tcatgggttc attgctggtg atctgggacc ttctaccatt gtagggagtg 720 
cagccttcaa catgttcatc atcattggca tctgtgtcta cgtgatccca gacggagaga 780 
ctcgcaagat caagcatcta cgagtcttct tcatcaccgc tgcttggagt atctttgcct 840 
acatctggct ctatatgatt ctggcagtct tctcccctgg tgtggtccag gtttgggaag 900 
gcqtcctcac tctcttcttc tttccagtgt gtgtccttct ggcctgggtg gcagataaac 960 
gactgctctt ctacaaatac atgcacaaaa agtaccgcac agacaaacac cgaggaatta 1020 
tcatagagac agagggtgac caccctaagg gcattgagat ggatgggaaa atgatgaatt 1080 
cccattttct agatgggaac ctggtgcccc tggaagggaa ggaagtggat gagtcccgca 1140 
gagagatgat ccggattctc aaggatctga agcaaaaaca cccagagaag gacttagatc 1200 
agctggtgga gatggccaat tacCatgctc tttcccacca acagaagagc cgtgccttct 1260 
accgtatcca agccactcgt atgatgactg gtgcaggcaa tatcctgaag aaacatgcag 1320 
cagaacaagc caagaaggcc tccagcatga gcgaggtgca caccgatgag cctgaggact 1380 
ttatttccaa ggtcttcttt gacccatgtt cttaccagtg cctggagaac tgtggggctg 1440 
tactcctgac agtggtgagg aaagggggag acatgtcaaa gaccatgtat gtggactaca 1500 
aaacagagga tggttctgcc aatgcagggg ctgactatga gttcacagag ggcacggtgg 1560 
ttctgaagcc aggagagacc cagaaggagt tctccgtggg cataattgat gacgacattt 1620 
ttgaggagga tgaacacttc tttgtaaggt tgagcaatgt ccgcatagag gaggagcagc 1680 
cagaggaggg gatgcctcca gcaatattca acagtcttcc cttgcctcgg gctgtcctag 1740 
cctccccttg tgtggccaca gttaccatct tggatgatga ccatgcaggc atcttcactt 1800 
ttgaatgtga tactattcat gtcagtgaga gtattggtgt tatggaggtc aaggttctgc 1860 
ggacatcagg tgcccggggt acagtcatcg tcccctttag gacagtagaa gggacagcca 1920 
agggtggcgg tgaggacttt gaagacacat atggggagtt ggaattcaag aatgatgaaa 1980 
ctgtgaaaac cataagggtt aaaatagtag atgaggagga atacgaaagg caagagaatt 2040 
tcttcattgc ccttggtgaa ccgaaatgga tggaacgtgg aatatcagat gtgacagaca 2100 
ggaagctgac tatggaagaa gaggaggcca agaggatagc agagatggga aagccagtat 2160 
tgggtgaaca ccccaaacta gaagtcatca ttgaagagtc ctatgagttc aagactacgg 2220 
tggacaaact gatcaagaag acaaacctgg ccttggttgt ggggacccat tcctggaggg 2280 
accagttcat ggaggccatc accgtcagtg cagcagggga tgaggatgag gatgaatccg 2340 
gggaggagag gctgccctcc tgctttgact acgtcatgca cttcctgact gtcttctgga 2400 
aggtgctgtt tgcctgtgtg ccccccacag agtactgcca cggctgggcc tgcttcgccg 2460 
tctccatcct catcattggc atgctcaccg ccatcattgg ggacctggcc tcgcacttcg 2520 
gctgcaccat tggtctcaaa gattcagtca cagctgttgt tttcgtggca tttggcacct 2580 
ctgtcccaga tacgtttgcc agcaaagctg ctgccctcca ggatgtatat gcagacgcct 2640 
ccattggcaa cgtgacgggc agcaacgccg tcaatgtctt cctgggcatc ggcctggcct 2700 
ggtccgtggc cgccatctac tgggctctgc agggacagga gttccacgtg tcggccggca 2760 
cactggcctt ctccgtcacc ctcttcacca tctttgcatt tgtctgcatc agcgtgctct 2820 
tgtaccgaag gcggccgcac ctgggagggg agcttggtgg cccccgtggc tgcaagctcg 2880 
ccacaacatg gctctttgtg agcctgtggc tcctctacat actctttgcc acactagagg 2940 
cctattgcta catcaagggg ttctaa 2966 

<210> 42 
<211> 4404 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 6046484CB1 

<400> 42 

atgtatatcc gtgtatccta tgacaccaag ccagactcac tgctccatct catggtgaaa 60 
gattggcagc tggaactccc caagctctta atatctgtgc atggaggcct ccagaacttt 120 
gagacgcagc ccaagctgaa acaagtcttt gggaaaggcc tgatcaaggc tgctatgacc 180 
accggggcct ggatcttcac cgggggtgtc agcacaggtg ttatcagcca cgtaggggat 240 
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gccttgaaag accactcctc caagtccaga 
tggggcatcg tggagaataa ggaagacctg 
accatgtcca accctctaag taagctctct 
ctggctgaca atggcaccct gggcaagtat 
gaaaagcata tttcactcca gaagataaac 
gcactcatag tggaaggagg acccaatgtg 
acccctcccg tgccagtggt tgtctgtgat 
tttgggcata aatactcaga agaaggcgga 
ttggtgacta tacagaagac tttcacatac 
atcctcatgg agtgcatgaa gaagaaggaa 
ggacaccagg acattgattt ggctatcctg 
gccccagacc aactgagctt agctttagcc 
atctttattt acgggcaaca gtggccggtg 
ttagttctgg acagagtgga ttttgtgaaa 
cgttttctca ccatctccag actagaggaa 
acattgtacc acttggtcag ggatgtcaaa 
agcctgattg acatcggcct ggtgatcgag 
tacacgcgca agcgcttccg gaccctctac 
attcccttga ggcgaggaag aaagacaacc 
ttggatgatc ctgagatcaa ccacttcccc 
gttctcatga agcggcagaa gatggccctg 
gccaaggccc tggtggcctg caagctctgc 
gacatggttg acgacatttc ccaggagctg 
gctgtggagc tcctggacca gtcctacaag 
acgtatgagc tgaagaactg gagcaacgcc 
caccgcgact tcatcgcgca cacgtgcagc 
cggctccgca tgcgcaagaa ctcaggcGtc 
accatcttgt ttttggaatt tcgcacatat 
aatgaggatg gcaaagaaaa agaagaggaa 
agaaaggggg atgaggagaa cgagcacaaa 
atctgtgaat tctataacgc gcccattgtc 
ggctacctgc tgctgtttaa cCacgtcatc 
caggagtgga tcgtcatctc ctacatcgtg 
gctactccaa aggcctcccc cagcccacta 
tactggaaca tcacagatct cgtggccatt 
ctacagaacc agccctacat gggctatggc 
tggtacatcc gtgtcctgga catctttggt 
atgattggaa agatggttag tagtggcatc 
atgagtttcg gagtagcccg tcaagccatt 
ctggcccgaa acatcttcta catgccctac 
cagatagacc gtaagagttt ttttctttca 
gagggcaagc ggcttcctcc ctgtatcccc 
tgctatctac tggtcgccaa catcctgctg 
accttctttg aagtaaaatc aatatccaac 
attatgacat ttcatgacag gccagtcctg 
tacatcatca ttatgcgtct cagcggccgc 
gaacgggatc gtggattgag tatgttcctt 
ttcgaggagc agtgcgtgca ggagcacttc 
agcgacgagc gcatccgggt cacttctgaa 
gaaatcaatg aaagagaaac ttttatgaaa 
gctcagctag aagaattatc taacagaatg 
gacaggtctg acctgatcca ggcacggtcc 
cttctccggc aaagcagcat caatagcgct 
aacggagaag agttattaCt tgaggataca 
aggaaaaaaa cctgttcctt ccgtataaag 
ccagaatgtc agaacagtct tcacctttca 
ggcagtcacc ttgcagtaga tgacttaaag 
attgggattt caaaggaaga tgatgaaaga 
tccccaagtt taaataaaac agatgtgata 
act cage taa cagtggaaac gacaaatata 
accaaaatta cacgctattt ccccgatgaa 
agaagcttcg tctattcccg gggaagaaag 
tacagttcaa tcacggacca gcaattgacg 



ggccgggttt gtgctatagg aattgctcca 300 
gttggaaagg atgtaacaag agtgtaccag 360 
gtgctcaaca actcccacac ccacttcatc 420 
ggcgccgagg tgaaacttcg aagacaactg 480 
acaagaatcg gtcaaggtgt tcctgtggtg 540 
atctcgattg ttttggagta ccttcgagac 600 
gggagtggac gggcatcgga catcctggcc 660 
ctgataaatg aatctttgag ggaccagctg 720 
actcgaaccc aagctcagca tctgttcatc 780 
ttgattacgg tatttcggat gggatcagaa 840 
acagctttac tcaaaggagc caatgcctcg 900 
tggaacagag tcgacatcgc tcgcagccag 960 
ggatctctgg agcaagccat gttggatgcc 1020 
ttactcatag agaatggagt aagcatgcac 1080 
ttgtacaata cgagacatgg gccctcaaat 1140 
aaggggaacc tgcccccaga ctacagaatc 1200 
tacctgatgg gcggggctta tcgctgcaac 1260 
cacaacctct tcggccccaa gagggatgat 1320 
aagaaacgtg aagaagaggt ggacattgac 1380 
ttccctttcc atgagctcat ggtgtgggct 1440 
ttcttctggc agcacggtga ggaggccatg 1500 
aaagccatgg ctcatgaggc ctctgagaac 1560 
aatcacaatt ccagagactt tggccagctg 1620 
caggacgaac agctggccat gaaactgctg 1680 
acgtgcctgc agcttgccgt ggctgccaaa 1740 
cagatgctgc tcaccgacat gtggatgggc 1800 
aaggtaattc tgggaattct tctacccccc 1860 
gatgatttct cgtatcaaac atccaaggaa 1920 
aatacggatg caaatgcaga tgctggctca 1980 
aaacagagaa gtattcccat cggaacaaag 2040 
aagttctggt tctttcagat atcatacttg 2100 
ctggtgcgga tggatggctg gccgtccctc 2160 
agcctggcgt tagagaagat acgagaggtg 2220 
gccagaaaat caaagtttgg cttccaggag 2280 
tccacattca tgattggagc aattcttcgc 2340 
cgggtgatct actgtgtgga tatcatcttc 2400 
gtcaacaagt atctggggcc atacgtgatg 2460 
ctgtgggtag tcatcatgct ggCcgtgctc 2520 
ctgcatccag aggagaagcc ctcttggaaa 2580 
tggatgatct atggagaggt gtttgcagac 2640 
gctccttgtg gtgagaacct atatgatgag 2700 
ggcgcctggc tcactccagc actcatggcg 2760 
gtgaacctgc tgattgctgt gttcagcaat 2820 
caggtgtgga agttccagcg atatcagctg 2880 
cccccaccga tgatcatttt aagccacatc 2940 
tgcaggaaaa agagagaagg ggaccaagag 3000 
agcgacgagg agctaaagag gctgcatgag 3060 
cgggagaagg aggatgagca gcagtcgtcc 3120 
agggttgaaa atatgtcaat gaggttggaa 3180 
acttccctgc agactgttga ccttcgactt 3240 
gtgaatgctc ttgaaaatct tgcgggaatc 3300 
cgggcttctt ctgaatgtga ggcaacgtat 3360 
gatggctaca gcttgtatcg atatcatttt 3420 
tctctctcca cgtcaccagg gacaggagtc 3480 
gaagagaagg acgtgaaaac gcacctagtc 3540 
ctgggcacaa gcacatcagc aaccccagat 3600 
aacgctgaag agtcaaaatt aggtccagat 3660 
cagacagact ctaaaaaaga agaaactatt 3720 
catggacagg acaaatcaga tgttcaaaac 3780 
gaaggcacta tttcctatcc cctggaagaa 3840 
acgatcaatg cttgtaaaac aatgaagtcc 3900 
ctggtcggtg gggttaacca ggatgtagag 3960 
acggaatggc aatgccaagt tcaaaagatc 4020 
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acgcgctctc atagcacaga tattccttac 
cataaagagc agtttgcaga tatgcaagat 
atccctcgct tgtccctaac cattactgac 
aagccagatc aaactttggg attcccatct 
aggaatgtga aatccattca gggaaagtta 
agcttagtaa ttgtgtctgg aatgacagca 
tccacagaaa ctgaatgcta gtct 



attgtgtcgg aagctgcagt gcaagctgag 4080 
gaacaccatg tcgctgaagc aattcctcga 4140 
agaaatggga tggaaaactt actgtctgtg 4200 
ctcaggtcaa aaagtttaca tggacatcct 4260 
gacagatctg gacatgccag tagtgtaagc 4320 
gaagaaaaaa aggttaagaa agagaaagct 4380 

4404 



<210> 43 

<211> 669 

<212> Dim 

<213> Homo sapiena 



<220> 

<221> inisc_feature 

<223> Incyte ID N6: 7481427CB1 



<400> 43 

atgacggaac aagccatctc cttcgccaag gacttcctag. ctggaggcat caccgccgct 60 
atctccaaga cagccgtggc atcgatcaag cgggtccagc tgctgctgca gatgcagcac 120 
gccagcatgc cgatggccgc cgccaagcag tgcaagggca tcgtggactg catcgtccgc 180 
atccccaaag atcagggcgt gctgtccttc tggaggggca acctggccaa tgtcatccgc 240 
tactccccca cgcaagccct caatttcgcc ttcaaggata agtacaagca gatcttcctg 300 
gcgggcgtgg acaagcacac gcagttctgc aggtactttg cgggcaacct ggcctctggc 360 
ggcacggccg tcgtgtatcc cctggatttc accagaaccc gcctggcagc cgacgtggga 420 
aagtcaggca cggagcgcga gttccgaggc ctgggagact gcctagtgaa gatcagcaag 480 
tccgacggca tccggggcct ttaccagggc ttcagcgtct ccgtgcaggc catcatcatc 540 
taccaggcag cctacttcag ggtgtacgat acggccaatg gcatgttccc cgaccccaag 600 
aacacacaca tcctggtgag ctggatgacc gcgcagaccg tgacggccgt ggctggcgtg 660 
ctctcctaa 669 

<210> 44 

<211> 1823 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7483595CB1 

<400> 44 

aaaaaaaaac aaataaaaaa tgcccagtgt gagcatctgt agtcccagct atttgggaag 60 
ttgaagcaga gaatcccttg tgcccaggag tttgaggtta cagtgagcta tgattgcacc 120 
actgcactcc ggcctgggca acagagtgag atcctgtctc ataaataaat aaataggcaa 180 
aggcttgttc tgatcacctt gtccctgttt tccccaagtc aacctagaga gcagtagatt 240 
ctagggcaag aacttaagtg agctgactgc tcacaaagaa ggaaaggagt agtgtgagct 300 
agcacttgtt gggagcctct gatcaacaat acaaggaccc gtgttatcaa cttctgttca 360 
ttagggaaag agagttacaa gagttaccaa aggggcctat cttaactttt ctatggaaac 420 
ccttcttttc ctggagagtg ccattggtgc cattgctgga ttgaagacat tctaccccac 480 
tctggttttc tcttctctgc agctaataat gggcgtattg ggtttgggct tcattgccac 540 
ttaccttccg gagtctgcaa tgagtgctta cctggctgct gtggcacttc atatcatgct 600 
gtcccagctg actttcatct ttgggattat gattagtttc catgccggtc ccatctcctt 660 
cttctatgac ataattaatt actgtgtagc tctcccaaaa gcgaattcca ccagcattct 720 
agtatttcta actgttgttg ttgctctgcg aatcaacaaa tgtatcagaa tttctttcaa 780 
tcagtatccc attgagtttc ccatggaatt atttctgata attttacaag ccttctcctt 840 
atctttggtg agctcctttc tgctcatatt tctgggcaag aagattgcca gtcttcacaa 900 
ttacagtgtc aattccaacc aggatttaat agccatcggc ctttgcaatg tcgtcagttc 960 
atttttcaga tcttgtgtgt ttactggtgc tattgctagg actattatcc aggataaatc 1020 
tggaggaagc acaacagttt gcatctctgg taggcgcaga gctaagattc ttctcctggg 1080 
tcaaatccct aacaccaaca tttatagaag catcaatgat tatcgggaga tcatcaccat 1140 
tcctggggtg aaaatcttcc agtgctgcag ctcaattaca tttgtaaatg tttactacct 1200 
aaagcataag ctgttaaaag aggttgatat ggtaaaggtg cctcttaaag aagaagaaat 1260 
tttcagcttg tttaattcaa gtgacaccaa tctacaagga ggaaagattt gcaggtgttt 1320 
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ctgcaactgt gatgatctgg agccgctgcc 
taaactggat cccgaagcat cctccgttaa 
gaacacaagc caaactgcat ccgaagacca 
gaaaaatcaa gggcaacagt atgaggaggt 
aagaaacagc tcaccaggac tgcctgatgt 
cccttactca gatgcgtctc tactgcccag 
ggtacactac gtggattcaq gggggttagt 
cttggcagga gctctgatcc ccctcctacc 
tctgttagtc actgatcagc tgc 

<210> 45 
<211> 2931 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<223> incyte ID No: 3788427CB1 

<400> 45 

atgtccatag ttcgtttatc cgtccatgcc aaatggatta tggggaaggt gactgggaca 60 

aaaatgcaaa agactgctaa agtgagagtg atcaggcttg ttctggatcc ccatttatta 120 

aagtattata ataagcaaaa aacgtacttt gctcacaatg cccttcagca gtgcacaatt 180 

ggggatatcg tgcttctcaa agctttacct gttccacgaa caaagcatgt gaaacatgaa 240 

ctggctgaga ttgttttcaa agttggaaaa ctcgtagatc ctgtgacagg aaaacctcgt 300 

gcaggaacta cctatctgga gagtccattg agttcggaaa ccacccaggg cgtagacggg 360 

gcttcccgtc cctcacgcgg gcccgcccca tgccgggcgg ggcctggggc ccgccgcctc 420 

cgcccgtggc cggagagccc caggcccgag ccacgcggtc ttccgggccc gggtcggggc 480 

tcaatggcta cgtggaggcg ggacggccga ctgacaggcg gccaaaggct gctgtgcgct 540 

gggctggcgg ggacgctcag cctcagcctc accgcgcccc tggagctcgc caccgtgctg 600 

gcccaggttg gcgtcgtgcg aggccacgcc cggggaccgt gggccacagg gcaccgggtg 660 

tggcgggcag aggggctccg ggccctgtgg aaggggaacg cggtggcgtg cctgcgcctc 720 

ttcccctgca gcgccgtgca gctcgccgcc taccgcaaat ttgttgtgct gttcacagat 780 

gacctgggcc acatttccca gtggagctcc atcatggctg ggagtctcgc aggcatggtt 840 

tccaccattg taacatatcc tacagacctc atcaaaaccc ggttgatcat gcagaacata 900 

ctggaaccat cgtacagggg gctcctccat gctttttcta ctatttacca acaggaaggg 960 

ttccttgccc tttatcgagg ggtttccctc actgttgtag gtgctctccc gttctctgct 1020 

ggctcccttc ttgtttacat gaacctggag aaaatctgga acggaccccg agatcagttc 1080 

tctctcccac agaactttgc taatgtctgt ctggctgctg cagtgaccca gaccctctcc 1140 

tttccctttg agaccgtgaa gagaaagatg caggctcaga gcccctacct cccacacagt 1200 

ggaggagtag atgtccattt ctcaggagca gtggactgct tccggcagat agtgaaggcc 1260 

cagggggtcc tggggctctg gaatggattg acagccaatt tactgaagat agttccatat 1320 

tttggaatta tgtttagcac ctttgagttc tgcaagagaa tctgtcttta tcaaaatggt 1380 

tacattctgt ctccactgag ctataaattg accccaggag tcgatcagag tttgcagccc 1440 

caggaattac gagaattaaa gaagttcttc aaaacgagaa aactgaagcc taaaaaacca 1500 

acuctataaa atggaatgga actagaagtg cactgactga cagcatgttg caatcacaga 1560 

taaatgtagc ctcataactg tgcacctgag gagggatgag aagctactgc tatctgctgc 1620 

tctgggaaat gagatggttt ggccatgcac cacctcagat ctccaaactg atgtccctgt 1680 

gaacacctgc actatgagaa ggtttcccaa caagagtgta tcatgatggc attccacgag 1740 

attttataac cagagtctag cagtgataaa atgtacaatt tacgctctga atagaaacat 1800 

ggcaccaaag aaagggcatt gccatgacat ctcaaagcaa caggtaattg aagtggctga I860 

gtggatacca cgttgtgagg tcagccactg ggtcacttgt agtcaatgca aagcttgcac 1920 

aaaccaacgt cagcctagtt tcactcagtt tgacctgaga ctttccaata caatctgttg X98Q 

ttttgttttg tttttagaga cagggtcttg ctctgtcacc cagactggag tacaatgaca 2040 

caatcatagc tcactgcagc ctttaactac tgggctcaag acatcctcct gcttcagccc 2100 

ccagagagtt gggaccatag gtgcacacca ccacacctag ctaatttttg ggggaggtct 2160 

tgctatgttg cccaggctgg tcttgaactc ctgggctcat gcaatcctcc tgccttggcc 2220 

tcccaaagcg ctaggattag aggtgtgagc cgctgcaccc tgccccagta caatcttttt 2280 

tgaactcaaa tttttgctga catctgagtg cacacaccac agtgtaaatt atgccttatc 2340 

agaatctaaa tgaaaatagc gaacatttaa aagctatcac cattgtagta gaatcatcct 2400 

tcttttttga aatttgaagc atcccaggct taaaatcttg tgtttcagaa agacagttta 2460 

taccatgact gcttaattat ccccccaaag accttctgat tgaagtcatg tacagttcag 2520 

tggcctaaat tctctgcctt tttaacttgc tttgcaagcc tactctgaaa ataagttatt 2580 
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caggattctt tacacagagc gatttgaaaa 1380 

cctgattcac tgctcacatt ttgagagcat 1440 

agtgccatac acagtatcgt ccgtgtctca 1500 

ggaggaagtt tggcttccta ataactcatc 1560 

ggcggaaagc caggggagga gatcactcat 1620 

tgtccacacc atcatcctgg atctctccat 1680 

cgtattaaga caggtaagta ctgaggaggc 1740 

atcccaaccc caccctgatc ctgattgaat 1800 

1823 



tagtcaagtt attctcaaag atgtcccagt 
cacacatact caaaaaaatg taactgacta 
actttctaaa aatcaaatct caggattgtt 
taactatact tgtcaaatag cadctatcta 
aaaattgtta atttatacac acacatacac 
gcaagtattt tttccgatta aatttatttt 



tgcctagaaa ggatcaaatg gaaqatttga 2640 
taaacacttt aacctaatca tctgtatcaa 2700 
ccactttaga gattctatgt aaagtttata 2760 
tgcatttaaa aatgcattaa tatctaacat 2820 
ttatacatag agagagattt tcggtatgtt 2880 
ccaaagaaaa aaaaaaaaaa a 2931 



<210> 46 

<211> 1492 

<212> Wfi 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 6972455CB1 

<400> 46 

cttgctcctt gatcagtttg cggatggcct 
tggaattcat aaaaagctca gggttcccga 
atagctggct gcaagccacg ccccaaaggc 
ggtgcgggaa acctgaccac tgttcagacc 
gccttgcaga cgatccgaac ggtgctggca 
tccttcgcct ttttcggaga gctgccatga 
tgtttgccag cctgatgctg agccaggcac 
tcgcccgtgg caccctgaag gttgccgtcc 
gccctgacat gcaaccccgg ggcctggaca 
tcaaggtcaa gctggaactg acgccggtca 
ctggcaaagt cgacctggtt atttccagcc 
tcgaettctc caacgcctac gcgccgttct 
ccatcgccag cctcgacgac ctcaagggca 
aagacatcga gctgaccgcg gtagcgccca 
acaactccac catcgccgcc tacctggccg 
tggtcatggt ggccatcagc gaacgcaacc 
tcaaggattc gccggtgtac gtaggcgtga 
tcaaccagat cctggtggca gccaaggctg 
ggctcaaaga accgctgcct gccgacctct 
cgactttgtg ccggtgctgg ccaataccga 
tgagctgacg gccattggcg ccatcctcgg 
acgggcgtgg aagatccaac cgtttgcgtg 
caacacgccg ttcctggtgc aactgttctt 
gaaaattacc gagtggcaag ccgccgtgct 
caccgagatc atccgtgccg gcatccaggc 

<210> 47 

<211> 2406 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 

<223> incyte ID No: 807766SCB1 



gcaaggccgc ttccggtacg gcttgggcaa 60 
tagcgtgtgt gacccgttgt aaaactattc 120 
gctcgcggct ttttgcaaac gctcgaaaag 180 
agtaagacca ctgcacctca acaaacatca 240 
tgggctgtgc actgggcaaa ctcagaaaat 300 
ccaagcgcta cagcgccctg ctcactgccc 360 
ccgcccaggc gagtggtctg gatgacatcg 420 
cccaggactt cccgccgttt ggttcggtcg 480 
tcgacaccgc caagctgctg gccgaccaac 540 
acagcaccaa ccgcataccg ttcctcacca 600 
tgggcaagaa ccccgagcgc gcaaaggtca 660 
accttgccgt gtttgggccg cctgatgccg 720 
agaccatcag cgtgacccgt ggcgccatcg 780 
aggaagccac gatcaagcgt tttgaagaca 840 
gccaggtcga cctgatcgcc agtggcaacg 900 
ccaagcgcgt accggcactg aaggtgaaac 960 
acaagaacga gccggcgctg ctggaaaagg 1020 
acggcagcct tgggaaaaac gcgatgcagt 1080 
gacgcggagc tcaaagatgg cctatcaatt 1140 
cctgctcctg cgtggcgcgc tgttcaccct 1200 
cgtggccctg ggcactgtcg gcgccgtggt 1260 
ggtctttggt gtgtatgtcg agttgatccg 1320 
catcttcttc ggcctgccat ccctcgggtt 1380 
ggcgatggtg atcaacctgg gggcctactc 1440 
catcccacgg gggcaactigg ag 1492 



<400> 47 

atgtatgacc caccaaaagt gaataagaaa 
gtggaaatac agtcagccct tcatatctgt 
taccgccggt cacccgcgcg ccggcacacg 
cgcgtctgcg tggggtgcca aggggcccgg 
ggctccgcgg aggagccagt agagagaaag 
gccagcctga ccaaggatgc tgttaccttc 
atccctgctt gctggagaat gacccagatg 
gcccagcagg atgccctggc ccagccccag 
gagcctgtgc tgtttgggac ctgggatggt 
ggggttgtgc tcttcctgag gactggctgg 



aatgactctg ctgatcattc tgatgaatct 60 
ggatttgacc aaggcaggtg gatcctctcc 120 
ggcctggccc cagtacttgg ggtgcgggtc 180 
gtcccagcgt ggcccaaacg tgagggtagg 240 
gatggacaac aaggggagag gagaggagaa 300 
cagagcagtc tccctgtcca gatcaccagg 360 
tcccaggtgc aggagctctt ccatgaggca 420 
ccctggtgga agacccagct gttcatgtgg 480 
gtgttcacat cctgcatgat caacatcttt 540 
ctggtgggaa acacaggagt getcctgggc 600 
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atgttcctgg tgtccttcgt catcctggtg 
gtcggggagc gcagcagcat cggcagcggt 
ggtgggcaga cgggaggcac catcgggctg 
ggctttgctg aatccatctc ggatttgctg 
atttcagttg cggtgcttct ggccttgctg 
atccgcctcc agctgctgtt gctgttcctg 
ggttctttca cccacctgga cccagaacat 
cagaacaaca cgctgcccga ttacagcccg 
ttcttcccag cggctacagg agtcatggcc 
cctgccgcca gcattcccct gggctccctg 
atcggctaca ggagtcatgg ccggcttcaa 
cagcattccc ctgggctccc tggcagctgt 
cagtctgagc tagagagtgg gagagaaaac 
tctgagagtg ccccgactgt gaccaatcta 
ggtgggctgg aaaattcacg tctggtgggt 
gggaatgctg ccgaggagga gtgtcgggac 
ttcctcggcc tcccgccccc gtccctcccc 
aaccaccgag tttccaggat cactggaggg 
cgaattgggg ctgcatctca gctctgccgc 
tgagttctct gagcctccat ttcctttctt 
ttcctagatt gcggcgggga gagaaggcac 
cttcccccag tgagcctccc ctccagctcc 
tcacactgtg ccacccacac aagcaggctg 
gaatattgtt tctctctgta ccacaggaag 
gaagaggcaa agagatcaga gaccagaaac 
tattctaacc acagtattta gaaaatatta 
tcttagctta ctgcagtgaa accattgcta 
gtgcaataac attgggaaga aaatgaagac 
aataaaagaa agaggcttct gaatgaggac 
ttccaggagg aacccttggc ccatcctctg 
gaacct 

<210> 48 

<21X> 3686 

<212> DNA 

<:213> Homo sapiens 

<220> 

<221> miscjeature 

<223> Incyte ID No: 55120485CB1 



gccctcgtca cggcgctgtc tgacattggc 660 
ggcgtctact ccatgatctc ctcggtcctg 720 
ctctatgtgt ttggacagat gtatatcacc 780 
ggcctcggga atatctgggc tgtgcgagga 840 
ggcattaacc tcgcaggtgt caaatggata 900 
ctggccgtgt ccacactgga ctttgtggtg 960 
ggtttcattg gatattcacc cgaactgcta 1020 
ggggaatctt ttttcactgt ctttggggtt 1080 
ggcttcaaca tggggggcga cctcagggag 1140 
gcagctgttg gcatctcgtg gtttctgtac 1200 
catggggggc gacctcaggg agcctgccgc 1260 
tggcatctcg taagttcccc tcatggcaat 1320 
cccaggcaga agagggattg ccccttttgt 1380 
tttcatattc cgatccttct tttaacactc 1440 
tcagacactg ccctttcatg cgccctgcct 1500 
ccagatcaca gatcgaatcc tttctttcct 1560 
cacccccatt ttgaacttgg ggattttttt 1620 
agcataaaga tgaagagttt tgcaatcaga 1680 
tgattggctg catgactctg gacaatttac 1740 
gtgaaatggg gataatgaca ctgtatgtgc 1800 
acagtggaag ctcagcaccc agctcttctt 1860 
cccaggccgt caggcctctg gggttttagt 1920 
ggtaaccttg ggtctatttc tgacctttta 1980 
aagcttctct gcccttcatt gggtaggtgt 2040 
aatgacttga gaagttaaaa gggcctgtgt 2100 
atgtatctgt ttcacaaaca ctgaatttgt 2160 
tcagcctggc gtacagcatc agggtatttt 2220 
aacagagata gagttgagtg ttgggaagga 2280 
cctgtcttgt ttctggcccc agctgggagg 2340 
attacccaga tgggaatgag gagaccacag 2400 

2406 



<400> 48 

atggaagaga acagtaagaa ggaccatcgg 
ctggaggtgt ttggttaccg ggaccacaat 
gtgctgacct gtgggggcct tctgctggtg 
gccaactgca tcccatgccc cttgcaagaa 
gaatttcaaa gatatatgag gaagaaggta 
cctgtaagca agaagtggga agaatccctg 
gccttaataa agccagaatt aaaactgcgg 
tggaacgacc tggagaagcg gtttcagaaa 
tctgacatcc atcagaeatt tggattgggt 
ttagtgtgtg ggcccaacgc cattgaggtt 
aaacaggttt taaatccatt ctatgtgttc 
caaggttaca tagaatactc tgtggccatc 
agtgtgtatg atttgcgaca gcaatcagtt 
aaagtccagg ttacaatcat tgtaaaagac 
ttggttcccg gagacattct tattcttcca 
ttgattgatg gaagctgcgt ggtgaatgaa 
acaaagacac cattgcccca gatggagaac 
gattatagga aacacgtcct tttctgtgga 
caggggcctg tacgagcagt cgttttgcaa 
gtgagatcca tcctgtaccc ccggcctctg 
ttcatcgtgt tcctggcctg ccttggtgtc 



gctttgctca accagggaga ggaggatgaa 60 
gtacggaaag ccttctgcct tgtcgcatcc 120 
ttctactgga gaccccagtg gagagtgtgg 180 
gcagacactg ttttgctgag gacaacagac 240 
ttctgcctct acttatacac actgaagttt 300 
gtggctgacc gccactctgt cataaaccaa 360 
tgcctggaag tgcagaaaat caggtatgtt 420 
gttgggttgc tagaagacag caattcctgc 480 
ctgaccagtg aagagcaaga ggtcagaaga 540 
gaaatccaac ccatatggaa gctgcttgtt 600 
caagccttca ccctaactct gtggctgtct 660 
atcattttga ctgttatctc cattgtctta 720 
aagctgcata acctcgtgga ggaccacaac 780 
aaaggtttgg aggagctgga atcccgtctc 840 
ggaaaatttt cattgccatg tgatgctgtt 900 
ggcatgctta caggagaaag tatacctgtt 960 
actatgcctt ggaaatgtca cagtttggag 1O20 
acagaagtta tccaggtcaa gccctctggg 1080 
acaggttaca atacagccaa aggggactta 1140 
aacttcaaac tatacagcga tgccttcaag 1200 
atgggttttt tctatgccct aggggtatat 1260 
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atgtaccatg gagttcctcc aaaagatact 
actgtccctc cagtgctgcc agctgccctg 
ctgaagaaaa agaaaatctt ctgtatctcc 
aacctcgtgt gctttgacaa aactggcact 
actgtcccta ctgctgacaa ctgcttccag 
gtgccatgga gcccactgtg tgcggccatg 
gggaccatcc agggagaccc tctggacctc 
gaagattgca ttgtagactc ctgcaaattt 
ggaccaaaag ccagtaagag tccagtggaa 
tcctcgagcc tgcagaggat gtccgtgatc 
gtctacatga aaggtgcccc agaaatggtg 
aagaatttcc cacaggaact gaggagttac 
gcccacaaaa ccttaaagat ggggaatctt 
gtggagtcag agttaacatt tctgggactt 
accaaactgg tcttgaagga actgagtgag 
gataaccttc aaacggccat tactgttgca 
caagtgatca ttgttgaggc cgatgaacca 
cagctggtgg agaaccaaga gactggacct 
aacagttcaa cccctcgtgg ggaaggagga 
tcataccaag tgatatttca gcatttcaac 
acagtttttg caagaatgtc tcctgggcag 
ttaaatgcct gcacagtaca aaatgaaagc 
ccagaaaaaa tggaaagtaa tagcaccttc 
ttgggaacaa tcaactgtat cactgtggct 
cagccaactt atacaaacta tatatttgtc 
ctattcattc tatttgctga tataccagaa 
cccgtcctgt ggagggcctc cattgtcatc 
gtggccgagg aggctgttat tgaaaatcga 
ggctatcagt caaaaagcca gtatcggata 
tggcccccgc taaaccaaac ctcccactct 
tacagcaatc cagtatttga gagcaatgaa 
gttgatatta cagagaaaca atgacaaaag 
actcttgtaa catcagctgg agttttgggg 
gtctgtttga taaaatactg tctacagaga 
ttggcttctg aaggtattag aggcattcag 
gggccaagag acaagcaggc taaaggggaa 
caatactttc ctttttataa aaatgagttg 
gttgtaattt ttgaattttt acttcatcga 
ccaatcttta ctgtctgggt tccttttctc 
ttctgtttct ggccttaccc aacatattgt 
aaaacaactc tcaaaaaaaa aaaaaa 



gtgaccatgg ccctgatcct cctcaccgtg 1320 
accataggca acgtgtatgc tcagaagaga 1380 
ccacagagaa tcaacatgtg tgggcaaata 1440 
ctcactgaag atgggctgga cctctggggg 1500 
gaagcccaca gctttgcctc aggccaggct 1560 
gccagctgcc actctctgat ccttctcaat 1620 
aaaatgtttg agggcactgc ctggaaaatg 1680 
gggacgtcag tttcaaacat cataaaacca 1740 
gccatcatca ccttgtgcca gtttccattt 1800 
gctcagctag ctggggagaa tcatttccat 1860 
gccaggttct gcagatctga aacagtgccc 1920 
acggtgcaag gcttccgtgt cattgctctt 1980 
tcagaagtcg agcacttagc cagagaaaaa 2040 
ctcatcatgg agaatcgctt gaaaaaagaa 2100 
gcccgtatca ggactgtgat gattacaggt 2160 
aagaattctg aaatgatccc tccaggcagc 2220 
gaagaatttg ttcctgcctc tgtgacctgg 2280 
gggaagaaag aaatctacat gcatactgga 2340 
agctgttacc attttgcaat gagtgggaaa 2400 
agcttgcttc caaaaattct ggtgaatgga 2460 
aaatcaagcc ttattgaaga atttcagaaa 2520 
atctcagagt taaccatgtc tccaactgct 2580 
acaagttttg agaacactac agtctggttc 2640 
cttgtgttct ctaaaggaaa accatttaga 2700 
cttgtgctga taatacagct tggtgtatgt 2760 
ttatatagac gtttggatct gctctgcact 2820 
atgctcagct tgaatttcat tgtgtccctt 2880 
gccctgtgga tgatgattaa aagatgtttc 2940 
tggcagaggg acttggcaaa tgaccctagt 3000 
gacatgccgg agtgtggcag aggagtgtct 3060 
gaacaactct gaaggacatg tgatatccaa 3120 
ggaccagttt cattgatttt aagaaatgag 3180 
ctacctatca aatcatggtg acaaactaaa 3240 
agaacattga cccagaagag tgctttcctc 3300 
tgaaggcaag aatgggccaa gaatttctat 3360 
aaatatgtga tctttcctct taggacagga 3420 
aggggaagta gtactgtgta tactcacgga 3480 
tggaaaaact aatgcgtggg aaacattaca 3540 
tgtagtactt ttcagatttg aggctttttc 3600 
taaggataag cttagcctat gctgcaataa 3660 

3686 



<210> 49 

<211> 2807 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No; 3112883081 



<400> 49 

aagaattcgg cacgaggcca cagccgcggg 
gtgaacagac actcccttct gtagccatgg 
ggatttaccc cactgtgatc ctctgcttat 
aaccattcct tatcccatat ttatctggac 
caaatgagat cttccccgtt tggacatact 
tcctcaccga ttatgtccgc tacaagccag 
ttacctggct gctgctgttg tttggccaag 
tctatgggat ggtcaccgcc gccgaggtgg 
gccccgagca ctaccagaga gtgagcggct 
cagcagggtc ggtgctggct caactcttgg 
tcaacgtcat atccttggcc tctgtctccg 



tgcgtggtcg tgcggcaagt gagcgatttg 60 
attgttacag aacttcacta agcagttcct 120 
ttggtttttt ctccatgatg agaccctcag 180 
cagataaaaa cctgaccagt gcagagataa 240 
cctacctggt gctgctgctg cctgtgtttg 300 
tcatcatctt gcaaggtatc agtttcatca 360 
gagtgaagac catgcaggtt gtagagttct 420 
cctactacgc ctacatatac agcgtggtca 480 
actgcaggag cgtcacgctg gccgcctaca 540 
tatccctggc gaacatgtcg tacttttacc 600 
tggctttcct tttctcactt ttcctaccaa 660 
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tgcccaagaa aagcatgttt tttcatgcaa 
gcgtgaatcc agtattagag gaaactcacg 
aacccacatc agaaatactc agcacttcag 
tgaaaccaag caatgtgact gtggacgttt 
gctactcctc aaaacgtctt ttctactggt 
ttaaccaggt tttgaactat gttcaaatcc 
cttccatcta taatggggcc gtagaagcta 
ttgcagtggg ttatgtgaaa gtcaactggg 
tctcagttgt caatgccggt tctttatttc 
gctatgctgg ctatttgata ttcaagtcca 
ttcagattgc agttaatctg aatgtggaac 
ttattgcctt ggtgattcag accatcatga 
acttgccagt cagcattcag tttttagttt 
ttttcctaat gagaagcatg tatattacct 
gccctgctcc aagtgagaat ccagatgtgt 
tgtcaacaaa actctaacct catcgcaaca 
taataaggaa agattttgag atgggtggca 
aaatctggat tccaatggac ctttcaaaac 
tctatgtgac caattttact ggatgcaatt 
catattgggg accccctgtg actagtagca 
aagacatgca gatggcgtgg aaccaaacca 
attcaccatt tgtgaggaga aatactaact 
tatttgatct taccaaagaa gtcaacacat 
ttcctcttct attaagtgta tttctgttta 
ttgtcccatt tctaagggct tctcttcaac 
tgatagtatt attatttaat ttttttatga 
tgttgaaagt ctccctcaag aatagcttca 
ggtcatttcc cagtggcctg ggagaggcag 
gttaatgcat ctatgggcca ggtgcagtgg 
gccaaggtgg gagttttgct tgaggctggt 
gcctcggcct cccaaagagc tgagattaca 
ctgattttta aatggtagca attgtgctaa 
gttaacattt tttgatgaat tctgctgaaa 
tcaagaaact ttcaataagg gatgcattga 
gtcagatttt attaggctgg acttgtggtt 
gtgagaggta attgaggttg gggtttaaac 

<210> 50 

<211> 2170 

<212> DNA 

<213> Homo sapiens 



aacccagcag agaaataaag aagtcatcaa 720 
aaggtgaagc accaggctgt gaagagcaga 780 
ggaagctgaa taagggccag ctgaacagcc 840 
ttgtgcagtg gttccaagat ttgaaggagt 900 
ctctatggtg ggctttcgcc acagcaggtt 960 
tgtgggacta caaggcgcca tcccaagatt 1020 
ttgcaacctt tggaggggct gtggctgcct 1080 
accttctggg agagctggct ctggtggcct 1140 
tcatgcatta cacagccaat atctgggcgt 1200 
gctatatgct tcttataacc atagcagtat 1260 
gctatgcctt ggtatttgga atcaacacct 1320 
ctgtgattgt agtagatcag agagggctca 1380 
atgggagcta ttttgcagta attgctggaa 1440 
actcaaccaa atcccagaag gatgtacaga 1500 
ctcacccaga ggaagagagt aatatcatca 1560 
aacgcaacag tggctttcaa agttatgcaa 1620 
tatgttttgc cataacttga catgctttgc 1680 
cacaacaaaa cctcagtttt agatgagttc 1740 
gacaggaccc gtcatcataa ttaaacaacc 1800 
gctggaaaat tctggttttt atcacttgta 1860 
tgagaaaact ccagccatcc tggagttgat 1920 
ggactgaccc tattgctagg cttaaatact 1980 
gggacctttg tgtcacatga accattttct 2040 
agttacagtt ctctaagaga attacaatgt 2100 
tctaataaca gcattattca cgttatgatt 2160 
ttattttcca ttttgtgctc tgagttttgc 2220 
gatcctcttg tgtatttgca gaatacacaa 2280 
tgagccttct ctccaccacc atagacaggt 2340 
ctcactcctg taatccgagc aatttgggag 2400 
ctcaaactcc tgacttcagg tgatctgtct 2460 
ggcgtgagcc accatgcctg gccaaaacac 2520 
ataacatgtg tcaacaattt tcaaagagtt 2580 
attatctgaa catacctgtt tttcgaaact 2640 
gccgtctgac agtgagaata acgatgctga 2700 
acaactttat cccgcatttt gggaagctaa 2760 
ccttggcatt ttaacct 2807 



<220> 

<221> misc^feature 

<223> Xncyte ID No: 4253888CB1 

<400> SO 

gtgaatgccc cttccatccc ccgcccctgg 
Gcttctcaca ccggccgctg ccgccgccgc 
acttctgaac atcttttatt agtggaaata 
ttaaaccagt gtttgtgcgg ttctgattca 
aggagtacag ggtcttttgt gatgacaata 
ataaaacaag tggtggtcca agtagtggag 
acacaattca atccgacaag tttttaaatt 
agaaaaacca agaagacagg cacaaaaaac 
ctgcagtaca gacttttagt aatagatctg 
agcagtcttt tcatgccaat aatgggcacg 
actatgccaa ttataattac tgtgatggaa 
aagatgaaga tatatctagt gatggtgatg 
taccaaagga atccagtggc atcatggcat 
gttttggaac agtttcagct ggcatggtac 
gaaaagttac agaagttttc attttagtcc 
aaatgacatt ggcatccaga ttatccactg 



tgtctcggtc tggtctgagg agacggggac 60 
gctccggaac agatccagtc cttctgtgga 120 
ttttctacac aatgaagtca acaacttaat 180 
tctgctgtgg ttcccgaagc ttgagatcta 240 
tgactaatag taaaggaaga tctattaccg 300 
gaggttttgt. agattggact ttacgtttaa 360 
tactcttgag tatggttcca gtgatttacc 420 
caaacggcat ttggcaagat ggattatcaa 480 
agcaacacat ggagtatcac agtttctcag 540 
catcatcaag ctgcagccaa aagtatgatg 600 
gggagacttc agaaaccact gccatgttac 660 
aagatgctat tgtagaagtg accccaaaat 720 
tgcaaatact tgtgcccttt ttgctagctg 780 
tggatatagt acagcactgg gaggtgttca 840 
ctgcacttct tggtctcaaa gggaacttgg 900 
cagtaaatat tgggaagatg gattcaccca 960 
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ttgaaaagtg gaacctaata 
tgggttttct agcagctgtg 
accttgatca ttccatactt 
ttctgcaggg aataataatg 
ctgataatgt tgctacaccc 
tggcttggat aagtcagggc 
tagttggtgt atttttcttg 
cagccacaag aacagttctc 
gtagcattgg gggccttatt 
ttgtttacac gccagttatt 
ggatttctac ctacctccat 
gttgttacta cccatttaga 
ttctactgct tttagtgatt 
aaagtggtca tacttcttta 
tacaggtatt taccttgctg 
gaaaggaccc ggatagtttc 
cagctctgtt agccttaagt 
ttggagacta ataaattcta 
accacttatg gctctttttc 
cttcagttgg ccctgattca 
aaaaaaaaaa 

<210> 51 
<:211> 1722 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> incyte ID No: 7479974CB1 

<400> 51 

atggatgctg ttaaatactt aaataaattg aatttagata acattgagtt aacaaaatat 60 
ttgtttttta ctggtaaagg tggcgtaggc aaaacaacga tatcaagttt tattgcttta 120 
aacttagcag agaatggaaa gaaagtagct ttagtaagta ctgatccagc tagtaattta 180 
caagatgtat ttcaaatgga attatctaat aaattaacta aatatcaacc tatacctaat 240 
ctctctatag ccaatttcga cccgattgtt gctgcagacg attataaagc acaatctata 300 
gaaccttatg agggtattct accagaagat gtgcttgctg aaatgaaaga acagttaagt 360 
ggttcatgta cagttgaagt agcagcattt aatgaattta caaatttttt atccgataaa 420 
actttagaac aagaatttga tttcattata tttgatacag ctcccacagg tcacactttg 480 
agaatgcttg aattaccttc tgcatggaca gattatttaa atacaacgag taatgacgct 540 
tcttgcttag gtcaattatc aggtttaaat gaaaatagag ttaaatataa ttcagcactt 600 
gaaaaactac gtaaccaaga tgatacgacc atgatgttag ttgcgagacc tactcactct 660 
tctatatatg aaattcaaag agcgcaacaa gaattacaac aactgtcaat ttctaaattc 720 
aaagtaatca ttaacaacta tatagaagaa agtcacggtt taatttcgag tcagatgaaa 780 
tcggaacaag ataaaaacat taatcatttt actgaatggt taaataacaa tcatgcttat 840 
tacgttccat ataaaaagca gaaagaagaa ggtatagaaa atttaactaa tctattaaat 900 
gatgataact taattgaaaa tgatgacttt attgttgaag atcatccgca attcaataaa 960 
ttaatcgatg aaattgaaaa tagtaaagtt caatatttat ttacaatggg aaaaggtggc 1020 
gttggtaaaa cgacagtagc aacgcaatta gctacagcat tatctaataa aggatatcgt 1080 
gttcttttag caactactga ccctactaaa gaaattaatg ttgaaactac aagtaattta 1140 
aatactgctt atattgatga agaacaagca ttagaaaagt ataaaaaaga agtactagcc 1200 
acagtgaatg atgatacacc acaagacgat attgattata ttatggaaga tttaaaatca 1260 
ccttgtacag aagaaatagc atttttcaaa gcctttagtg acattatgga gaatcaagac 1320 
gacatggatt acgtcattgt agacacagct cctacaggcc ataccttgct gttacttgat 1380 
tctagtgaaa atcatcatag agaattaaag aaaaaatcaa ctcaaactac cagtaatgtt 1440 
gaaacattat tacctaaaat tcaaaataaa aatttaacac agatgataat cgtaacacta 1500 
gcagaaaaaa caccttattt agaatctaaa cgtttagtag aagatttaaa tagagctaat 1560 
ataggccata attggtgggt tgttaatcaa tcgttagtta cgctaaatca acgtgatgac 1620 
ctttttagta acaaaaaaga agatgaatca ttttggataa acaagattaa aaatgaaagt 1680 
tttgacaatt attttgtcat accttatggg gggttatcat aa 1722 

<210> 52 



attggcaact tggctttaaa 
gcagcaatta tattgggctg 
ctgtgctcta gcagtgtggc 
gttggggtta tcgttggttc 
attgctgcta gttttggcga 
ttatactcct gtcttgagac 
gctctaaccc ctatttggat 
cactcaggct gggagcctgt 
ctggacacaa ctgtatcaga 
aatggtattg gtggtaattt 
ttacatagca ttccaggaga 
actttctttg gtccaggagt 
cctggacatt taattttcct 
actataatct tcatagtagt 
tggattgctg actggatggt 
tccatcccct acctaacagc 
tttcattttc tttggcttat 
caaactgctc tcaagttacc 
aaaactctta aatcagtagt 
attaaatggc cttaattttt 



gcaggttcag gcaacagtag 1020 
gattccagaa ggaaaatatt 1080 
aactgccttc attgcatctc 1140 
aaagaagact ggtataaatc 1200 
ccttataact cttgccatat 1260 
ctattactac atttctccat 1320 
tataatagct gccaaacatc 1380 
cataacagct atggttataa 1440 
cccaaacttg gttgggattg 1500 
gfirtggccatt caggctagca 1560 
attgcctgat gaacccaaag 1620 
aaataataag tctgctcaag 1680 
ctacactatt catttgatga 1740 
gtatttattt ggcgctgtgt 1800 
ccatcacttc tggaggaaag 1860 
attgggtgat ctgctcggga 1920 
tggagatcga gatggagatg 1980 
aaggaagaaa atacacgaca 2040 
ttgacttttg ccagggtaat 2100 
ttttaaggaa tttgtgtcaa 2160 

2170 
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<211> 1424 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Xncyte ID No: 7483850CB1 

<400> 52 

Gtgatgtcta tttaggaaga aacaaactgg tgtatatttg cttgtttcta cattttctag 60 
gcactgcttt gttatctgtg gtggcttttc ccttggaaga tttctatctg ggcacttacc 120 
atgcagttaa caacatacca aaaacagagc agcacaggct gttttatgta gcactgctga 180 
ccatttgcct tggcattgga ggcgtaagag ccatcgtctg tccactgggt gcttttggcc 240 
ttcaggagta tggatcacaa aaaacgatgt ctttttttaa ctgattcctt caggatatta 300 
tctgcaaact atgaattcca acctgaattt ggatggattt cttctgccga ttgcagtaat 360 
gaatgccatc agcagcctac eattactaat tctggctcct tttctggagt atttcagcac 420 
ctgcctgttt ccctctaaga gagttggatc atttctgtca acatgcatca ttgctggaaa 480 
tctttttgct gcattgtctg tgatgatagc tggcttcttt gaaatacacc gaaaacattt 540 
ccctgcagtg gagcagcccc tttcaggaaa agttctcact gtttcctcca tgccctgttt 600 
ctacctgatt cttcagtatg ttttacttgg agtggcggaa acactggtaa acccagccct 660 
ctctgtaata tcatacagat ttgttccaag caatgtcaga ggaacctcca tgaattttct 720 
gacactgttc aatggatttg gctgtttcac aggggcactg ctggtgaagt tggtatatct 780 
catctcagaa ggtaaaaata ggcaatggtt tccaaacaca ttaaacaaag gcaatttaga 840 
aggcttcttc ttcttcctgg catcattaac attgttgaac gtcctgggat tctgcagtgt 900 
ttcacaaaga tattgtaatc taaatcattt taatgcccag aacatccgtg gaagtaatct 960 
tgaagaaaca cttctcctcc acgaaaaatc tctgaaattt tatggcagta tacaggaatt 1020 
ttcttcaagt attgatcttt gggagacagc cctatgaaac tgtgtttgag tctacctgtc 1080 
ttatgagaac agtattcatt gttttcttct tcagttgagc attttatgtg ttttagtgta 1140 
aaagccaata tatacagagg tgattttaca aatatcatct gtatacttta tttacaaaga 1200 
ctaattatga attttctagt aaactgtcat gcttttgcat taacatgtta tagcattaga 1260 
atttaataaa ataatttcaa agtctcagtg attactctat gacaaaaaaa aaaggggggc 1320 
gccgattagt gagcttcgtc gaccgggaaa taattccgga cggacctgca ggcgtaggcg 1380 
atcctgagat tccatcaatg ggggcgtcga gcatgtttaa aggc 1424 

<210> S3 

<211> 3598 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 5508353CB1 

<400> 53 

caatttgcaa cacagtagtg gtttctgctc ctaaccaacc ccgacaaaag atcagacacc 60 
cttcactggg ggggttgccc attaagtctt tggaagagat taaaagtctt ttccagagat 120 
ggtctgtccg aagatcaagt tctccatcgc ttaacagtgg gaaagagcca tcttctggag 180 
ttccaaacgc ctttgtgagc agactccctc tctttagtcg aatgaaacca gcttcacctg 240 
tggaggaaga ggtctcccag gtgtgtgaga gcccccagtg ctccagtagc tcagcttgct 300 
gcacagaaac agagaaacaa cacggtgatg caggcctcct gaatggcaag gcagagtccc 360 
tccGtggaca gccattggcc tgcaacctgt gttatgaggc cgagagccca gacgaagcgg 420 
ccttagtgta tgccgccagg gcttaccaat gcactttacg gtctcggaca ccagagcagg 480 
tcatggtgga ctttgctgct ttgggaccat taacatttca actcctacac atcctgccct 540 
ttgactcagt aagaaaaaga atgtctgttg tggtccgaca ccctctttcc aatcaagttg 600 
tggtgtatac gaaaggcgct gattctgtga ccatggagtt actgtcggtg gcttccccag 660 
atggagcaag tctggagaaa caacagatga tagtaaggga gaaaacccag aagcacttgg 720 
atgactatgc caaacaaggc cttcgtactt tatgtatagc aaagaaggtc atgagtgaca 780 
ctgaatatgc agagtggctg aggaatcatt ttttagctga aaccagcatt gacaacaggg 840 
aagaattact acttgaatct gccatgaggt tggagaacaa acttacatta cttggtgcta 900 
ctggcattga agaccgtctg caggagggag tccctgaatc tatagaagct cttcacaaag 960 
cgggcatcaa gatctggatg ctgacagggg acaagcagga gacagctgtc aacatagctt 1020 
atgcatgcaa actactggag ccagatgaca agctttttat cctcaatacc caaagtaaag 1080 
atgcctgtgg gatgctgatg agcacaattt tgaaagaact tcagaagaaa actcaagccc 1140 
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tgccagagca agtgtcatta agtgaagatt tacttcagcc tcctgtcccc cgggactcag 1200 
ggttacgagc tggactcatt atcactggga agaccctgga gtttgccctg caagaaagtc 1260 
tgcaaaagca gttcctggaa ctgacatctt ggggtcaagc tgtggtctgc tgccgagcca 1320 
caccgctgca gaaaagtgaa gtggtgaaat tggtccgcag ccatctccag gtgatgaccc 1380 
ttgctattgg tgatggtgcc aatgatgtta gcatgataca agtggcagac attgggatag 1440 
gggtctcagg tcaagaaggc atgcaggctg tgatggccag tgactttgcc gtttctcagt 1500 
tcaaacatct cagcaagctc cttcttgtcc atggacactg gtgttataca cggctttcca 1560 
acatgattct ctattttttc tataagaatg tggcctatgt gaacctcctt ttctggtacc 1620 
agttcttttg tggattttca ggaacatcca tgactgatta ctgggttttg atcttcttca 1680 
acctcctctt cacatctgcc cctcctgtca tttatggtgt tttggagaaa gatgtgtctg 1740 
cagagaccct catgcaactg cctgaacttt acaaaagtgg tcagaaatca gaggcatact 1800 
taccccatac cttctggatc accttattgg atgcttttta tcaaagcctg gtctgcttct 1860 
ttgtgcctta ttttacctac cagggctcag atactgacat ctttgcattt ggaaaccccc 1920 
tgaacaeagc cgctctgttc atcgttctcc tccatctggt cattgaaagc aagagtttga 1980 
cttggattca cttgctggtc atcattggta gcatcttgtc ttatttttta tttgccatag 2040 
tttttggagc catgtgtgta acttgcaacc caccatccaa cccttactgg attatgcagg 2100 
agcacatgct ggatccagta ttctacttag tttgtatcct cacgacgtcc attgctcttc 2160 
tgcccaggtt tgtatacaga gttcttcagg gatccctgtt tccatctcca attctgagag 2220 
ctaagcactt tgacagacta actccagagg agaggactaa agctctcaag aagtggagag 2280 
gggctggaaa gatgaatcaa gtgacatcaa agtatgctaa ccaatcagct ggcaagtcag 2340 
gaagaagacc catgcctggc ccCtctgctg tatttgcaat gaagtcagca agttcctgtg 2400 
ctattgagca aggaaactta tctqtgtgtg aaactgcttt agatcaaggc tactctgaaa 2460 
ctaaggcctt tgagatggct ggaccctcca aaggtaaaga aagctagata ccctccttgg 2520 
agttgcaagt attctttcaa ggttggaaga gggattttga agaggtatct ctccaagcaa 2580 
gaatgacttg tttttccata agggacatga gcattttact aggcttggaa gagctgacat 2640 
gatgagcatt attgtatgtt tgtatataca tttgtgatag agggctagag tttgacctag 2700 
agagagttta aggaagtgaa atatttaatt cagaaccaaa tgcttttgta aaactttttg 2760 
gattttgtaa aagcattttc attctcttag aaattcaagt attttcaagg ggagtcattt 2820 
gagatatatt tattttacta ggagatctta tattctaggg aaatgcttta aatggtcagg 2880 
ctccagtcgg aattttttta agaaaaaagt agtttttaat acattggtta ggactcagag 2940 
gaaatacgga aaaaacattg tagatggtta atttacagat aaaatcccaa gagcctttta 3000 
aacaacaagg tacctaaaat agggtataat tatactgctt aaaatacagt tagtgcctat 3060 
taatagcttt ttatttccta tggggagatg cttttggtct tctggctgaa gatgtaggca 3120 
tacctctcac tcatttcaat gttttcctga ggtggagcct tcattggaaa ggggaaagag 3180 
ggattctagg gtttcatcag ggaccaggaa tgcattcctc tgtcaggttc caatcaagag 3240 
aagacctttt atgagatctg cctctgtata gatgtcgtca gttaggaaac tgaagccata 3300 
ggtcaggaga catcagctca gcctgtggcc atgggtgatt tcctgtattt taaaactgac 3360 
agtagcctga tccaacgtga tatcaatcat tttcaacacc atccttcaga ggacaatgtg 3420 
aggtcattaa gttttacaat tacctggaga ttttcaggtg gtggggcctc taaaattgga 3480 
ttaacacgga actatacagt ggtccggagg tcaaggggtc tcgaaaattt ggtacatctc 3540 
tcggcgggag tcacacagtc cctcggactt gtgcgggtgg cacacattta caggggca 3598 

<210> 54 
<211> 1485 

<2i2> vm 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> incyte ID No: 8543628CB1 

<400> 54 

gctattggat tctccctata tgagtcgtat taatttctac tacgaaatgc tcgccgtgct 60 
gacgctgatt tgcgtgctgt tgttcttgcc gttgtcgctg ttgctcagcc gcctggaaag 120 
gaggctccag catggccagt tcgggtcttg agttgctctg ggtgtcgttg ccgcaactgg 180 
gcaagggcgc tgcgcaaacc ctgtcaattt cgtttttgag catcggcttc agtacggttg 240 
gcggcgtgct ctacggcgtg ctgcgcactt tgaataacaa ggcgatcaac ggcgtgttgc 300 
gggtctacct ggaactgttc cgcgccattc cggtgctggt gtggttgtac ctgctgtttt 360 
ttggcgtgcc gatcttcttc ggcctgagca tcccgagttt ctggtgcgcg gtgctggtgc 420 
tgtcgttgtg gggcgccagc gaggtgggcg aagtggtgcg tggcgcgctg cattcgctgc 480 
cccgtggcca acgcgaagca ggcttgtcga ttggcctgtc cgacctacag ttgtacggct 540 
acgtactgct gccccaggcc ctcaggcgca tgacgccgcc gaccatcaac gtgtacacgc 600 
gcatcatcaa gaccagctcc ctggcggtgc tgatcggcgt ggtggatgtg atcaaggtcg 660 
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gccagcaaat catcgagcgc acctacaagt cggtattgat ctacggtgcg ctgttcctgt 720 
ttttcttctt tatctgctac ccgttgtcgg ccgcctccaa gctgctggaa cggcgctggg 780 
cccaagcatg agcgcattga tcgagttcca gggtttcaac aaattcttcg gcgaggcgca 840 
ggtgctcaag ggcatcgacc tgagcgtgca gcgcggcgaa gtggtggtga tcttaggccc 900 
cagcggctgt ggcaaaagca ccttgttgcg ctgcctcaat gggctggaag tcgcgcacag 960 
cggcagcctg cgctttgccg gcaaagagct gctggataaa accaccgact ggcgccaggt 1020 
acgccaggac attggcatgg tattccagag ctaccacctg tcccgcacat gagcgtgcct 1080 
cgaaacatcc tgctcggccg ctgaaagtgg aaaagcgcga ccgcgtggtt gcctgtgagc 1140 
aggcggaaaa atgctgggag cgccgtggcc ttgccgaaag cgcgacgcct ttcccacgcc 1200 
agctttccgg tggccagcaa caacgcatcg catcgtccgt cgttgtgcat gaaccctcag 1260 
tcatgctgtt gacgaagttc accggggcct cgccgaaatig gcaggaatgt tggagtgtcc 1320 
agaattgccg gaggggtacc ctgtgtttcc cagaatggtt gccggcggcg cgacagggtg 1380 
atgaggggcg tctcaaaaac ccggttttta ccgcacgcgc gactggagct tttcccccaa 1440 
gaggtaatta cgctgcctcg gggagaacaa cggagacaca cgcgc 1485 

<210> 55 

<211> 1470 

<212> DNA 

<213> Homo isapiens 

<220> 

<221> misc.feature 

<223> Incyte ID No: 7482754CB1 

<400> 55 

gcctgttctg cagaggagga ggctgggctc agggcacttt tcacgccccc tccgttgggc 60 
tggggacctt ggtaagccat ggagggacag accccaggaa gcaggggcct tccagaaaag 120 
cctcaccctg ccacggctgc tgccactctg tcctcgatgg gcgctgtctt catcctcatg 180 
aagtccgcgc tgggagctgg cctgctcaac ttcccctggg ccttctccaa agcgggcgga 240 
gtggtccctg ccttcctggt ggagctggtc tcgttggtct tcctgatcag cgggctggtc 300 
atcctgggct atgctgctgc tgtcagtggc caggccacct accagggtgt ggtcaggggg 360 
ctgtgtggcc ctgccattgg gaagctgtgt gaggcctgct tcctcctcaa cctgctcatg 420 
atctccgtgg ccttcctcag ggtgatcggg gaccagctgg agaagctgtg tgactccctc 480 
ctgtctggca ccccgcccgc cccgcagccg tggtacgcag accagcgctt caccctgccc 540 
ctgctctccg tgctggtcat cctgcccctg tctgccccgc gggagatcgc cttccagaaa 600 
tacacaagcc cctcacacgg tcactgtgtt agcatcctag gcactctggc tgcctgttac 660 
ctggccctgg tcatcaccgt gcagtactac ctctggcccc agggcctcgt gcgtgagtcc 720 
catccttcac tgagccctgc ctcctggacc tctgtgttca gtgtcttccc caccatctgc 780 
ttcgggtttc agtgtcacga agctgccgtc tccatctact gcagcatgcg caaacggagc 840 
ctctcdcact gggccctggt gtctgtgctg tccttgctgg cctgctgcct catctattca 900 
ctgacggggg tttatggctt cctgactttt gggacagaag tttctgctga cgtcttgatg 960 
tcctacccag gcaatgatat ggtcatcatt gtggcccggg tcctttttgc tgtctccatc 1020 
gtaactgtct accccatcgt gctcttcctg gggaggtcag tgatgcagga cttctggagg 1080 
aggagctgct tggggggatg ggggcccagc gccctggccg acccctcagg gctgtgggtc 1140 
cggatgccgc tgaccatcct gtgggtcacc gtgacgctcg ccatggcgct gtttatgcct 1200 
gacctcagcg agatcgtcag catcatcgga ggcatcagtt ccttcttcat cttcatcttc 1260 
ccaggccctg gaggcaaaca ccgtttctgc gaaggatggt ttcttaggaa accttcccaa 1320 
acagcccaag cattatctga gaaggggaag ccagcaaggt ttgtgcctca tctgtgcaat 1380 
gggtgtcgag cctataggac caagagtcaa gtgagaaacc tgcgagccac aggaagagag 1440 
tctctgtcta cccagtatga gggcatctga 1470 

<210> 56 

<211> 3132 

<212> x>m 

<213> Homo sapiens 

<220> 

<221> miscfeature 

<223> Incyte ID No: 3794818CB1 

<400> 56 

atggagtttg tgcgggcgct gtggctgggc ctggcgctgg cgctggggcc ggggtccgcg 60 
gggggccacc ctcagccgtg cggcgtcctg gcgcgcctcg ggggctccgt gcgcctgggc 120 
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gccctcctgc cccgcgcgcc tctcgcccgc gcccgcgccc gcgccgccct ggcccgggcc 180 
gccctggcgc cgcggctgcc gcacaacctg agcttggagc tggtggtcgc cgcgcccccc 240 
gcccgcgacc ccgcctcgct gacccgcggc ctgtgccagg cgctggtgcc tccgggcgtg 300 
gcggccctgc tcgcctttcc cgaggctcgg cccgagctgc tgcagctgca cttcctggcg 360 
gcggccaccg agacccccgt gctcagcctg ctgcggcggg aggcgcgcgc gcccctcgga 420 
gccccgaacc cattccacct gcagctgcac tgggccagcc ccctggagac gctgctggat 480 
gtgctggtgg cggtgctgca ggcgcacgcc tgggaagacg tcggcctggc cctgtgccgc 540 
actcaggacc ccggcggcct ggtggccctc tggacaagcc gggctggccg gcccccacag 600 
ctggtcctgg acctaagccg gcgggacacg ggagatgcag gactgcgggc acgcctggcc 660 
ccgatggcgg cgccagtggg gggtgaagca ccggtacccg cggcggtcct cctcggctgt 720 
gacatcgccc gtgcccgtcg ggtgctggag gccgtacctc ccggccccca ctggctgttg 780 
gggacaccac tgccgcccaa ggccctgccc accgcggggc tgccaccagg gctgctggcg 840 
ctgggcgagg tggcacgacc cccgctggag gccgccatcc atgacattgt gcaactggtg 900 
gcccgggcgc tgggcagtgc ggcccaggtg cagccgaagc gagccctcct ccccgccccg 960 
gtcaactgcg gggacctgca gccggccggg cccgagtccc cggggcgctt cttggcacgg 1020 
ttcctggcca acacgtcctt ccagggccgc acgggccccg tgtgggtgac aggcagctcc 1080 
caggtacaca tgtctcggca ctttaaggtg tggagccttc gccgggaccc acggggcgcc 1140 
ccggcctggg ccacggtggg cagctggcgg gacggccagc tggacttgga accgggaggt 1200 
gcctctgcac ggcccccgcc cccacagggt gcccaggtct ggcccaagct gcgtgtggta 1260 
acgctgttgg aacacccatt tgtgtttgcc cgtgatccag acgaagacgg gcagtgccca 1320 
gcggggcagc tgtgcctgga ccctggcacc aacgactcgg ccaccctgga cgcactgttc 1380 
gccgcgctgg ccaacggctc agcgccccgt gccctgcgca agtgctgcta cggctactgc 1440 
attgacctgc tggagcggct ggcggaggac acgcccttcg acttcgagct gtacctcgtg 1500 
ggtgacggca agtacggcgc cctgcgggac ggccgctgga ccggcctggt cggggacctg 1560 
ctggccggcc gggcccacat ggcggtcacc agcttcagta tcaactccgc ccgctcacag 1620 
gtggtggact tcaccagccc cttcttctcc accagcctgg gcatcatggt gcgggcacgg 1680 
gacacggcct cacccatcgg tgcctttatg tggcccctgc actggtccac gtggctgggc 1740 
gtctttgcgg ccctgcacct caccgcgctc ttcctcaccg tgtacgagtg gcgtagcccc 1800 
tacggcctca cgccacgtgg ccgcaaccgc agcaccgtct tctcctactc ctcagccctc 1860 
aacctgtgct acgccatcct cttcagacgc accgtgtcca gcaagacgcc caagtgcccc 1920 
acgggccgcc tgctcatgaa cctctgggcc atcttctgcc tgctggtgct gtccagctac 1980 
acggccaacc tggctgccgt catggtcggg gacaagacct tcgaggagct gtcggggatc 2040 
cacgacccca agctgcacca cccggcgcag ggcttccgct tcggcaccgt gtgggagagc 2100 
agcgccgagg cgtacatcaa gaagagcttc cccgacatgc acgcacacat gcggcgccac 2160 
agcgcgccca ccacgccccg cggcgtcgcc atgctcacga gcgacccccc caagctcaac 2220 
gccttcatca tggacaagtc gctcctggac tacgaggtct ccatcgacgc cgactgcaaa 2280 
ctgctgaccg tgggaaagcc cttcgccatt gagggctatg ggatcggact gccccagaac 2340 
tcgccgctca cctccaacct gtccgagttc atcagccgct acaagtcctc cggcttcatc 2400 
gacctgctcc acgacaagtg gtacaagatg gtgccttgcg gcaagcgggt ctttgcggtt 2460 
acagagaccc tgcagatgag catctaccac ttcgcgggcc tcttcgtgtt- gctgtgcctg 2520 
ggcctgggca gcgctctgct cagctcgctg ggcgagcacg ccttcttccg cctggcgctg 2580 
ccgcgcatcc gcaaggggag caggctgcag tactggctgc acaccagcca gaaaatccac 2640 
cgcgccctca acacggagcc accagagggg tcgaaggagg agacggcaga ggcggagcce 2700 
agcggccccg aggtggagca gcagcagcag cagcaggacc agccaacggc tccggagggc 2760 
tggaaacggg cgcgccgggc cgtggacaag gagcgccgcg tgcgcttcct gctggagccc 2820 
gccgtggttg tggcacccga agcggacgcg gaggcggagg ctgcgccgcg agagggcccc 2880 
gtctggctgt gctcctacgg ccgcccgccc gccgcaaggc ccacgggggc cccccagccc 2940 
ggggagctgc aggagctgga gcgccgcatc gaagtcgcgc gtgagcggct ccgqcaggcc 3000 
ctggtgcggc gcggccagct cctggcacag ctcggggaca gcgcacgtca ccggcctcgg 3060 
cgcttgcttc aggccagagc ggcccccgcg gaggccccac cacactctgg ccgaccgggg 3120 
agccaggaat ga 3132 

<210> 57 
<211> 1832 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4717525CB1 

<400> 57 

ggagctgacc ctgcggggtc ccgggggggg agggggagcc gcgaagcccc cactgagtgc 60 



75/83 



cgccgctgcc ggggggggag ggggagccgc gaagccccca ctgaggccgc cgctgccggg 120 
cctcccctcc cccccgggcg ggcgccatgc gggggagccc gggcgacgcg gagcggcggc 180 
agcgctgggg tcgcctgttc gaggagctgg acagtaacaa ggatggccgc gtggacgtgc 240 
acgagttgcg ccaggggctg gccaggctgg gcgggggcaa cccagacccc ggcgcccaac 300 
agggtatctc ctctgagggt gatgctgacc cagatggcgg gctcgacctg gaggaatttt 360 
cccgctatct gcaggagcgg gaacagcgtc tgctgctcat gtttcacagt cttgaccgga 420 
accaggatgg tcacattgat gtctctgaga tccaacagag tttccgagct ctgggcattt 480 
ccatctcgct ggagcaggct gagaaaattt tgcacagcat ggaccgagac ggcacaatga 540 
ccattgactg gcaagaatgg cgcgaccact tcctgttgca ttcgctggaa aatgtggagg 600 
acgtgctgta tttctggaag cattccacgg tcctggacat tggcgagtgc ctgacagtgc 660 
cggacgagtt ctcaaagcaa gagaagctga cgggcatgtg gtggaagcag ctggtggccg 720 
gcgcagtggc aggtgccgtg tcacggacag gcacggcccc tctggaccgc ctcaaggtct 780 
tcatgcaggt ccatgcctca aagaccaacc ggctgaacat ccttgggggg cttcgaagca 840 
tggtccttga gggaggcatc cgctccctgt ggcgcggcaa tggtattaat gtactcaaga 900 
ttgcccccga gtcagctatc aagttcatgg cctatgaaca gatcaagagg gccatcctgg 960 
ggcagcagga gacactgcat gtgcaggagc gcttcgtggc tggctccctg gctggtgcca 1020 
cagcccaaac catcatttac cctatggaga ctctgaagaa ctggtggctt cagcagtaca 1080 
gccacgactc ggcagaccca ggcatcctcg tgctcctggc ctgcggtacc atatccagca 1140 
cctgcggcca gatagccagt tacccgctgg ccctggtccg gacccgcatg caggcacaag 1200 
cctccatcga gggtggcccc cagctgtcca tgctgggtct gctacgtcac atcctgtccc 1260 
aggagggcat gcggggcctc taccggggga tcgcccccaa cttcatgaag gttattccag 1320 
ctgtgagcat ctcctatgtg gtctacgaga acatgaagca ggccttaggg gticacgtcca 1380 
ggctcgagta cagtggctcg atttcagatc actgcaacct ctgccttcca ggttcaagcg 1440 
attctcctgc ctcagcctcc agagtagctg gtattacagg atttcatcat gttgcccagg 1500 
ctcatctcgg actcgtgggt tcaaggaatt cggcagcctt cagccttcca acgtgctggg 1560 
attacaggaa gccggtcgtc atgccatgag cagccttatg gagtggacca tgtggtaagg 1620 
aactcagcca atagccatgt aactgagctt tgaagaggat cttgctgtcc tggccaacat 1680 
ctcactgcga ttctatcagt tgaatttcct ggatagtcca agctttgttg gatccctgca 1740 
ccggtacagc tgggttccag taactgaatc ctgagtctta tgatgtttta cttcaagcat 1800 
gggtcagggg acagcgggtt atgggttcct tg 1832 

<210> 58 
<211> 1902 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> iiiiBC_£eatiire 

<223> incyte ID No: 5091793CB1 

<400> 58 

cccaccttca gtcctggctc tgacaagccc tccagcttca cgccacccgg gatgggagaa 60 
agcaggtgtc gcgagagttg ggcgcaagac gccttgtagg gagtgtaact atggccggcc 120 
tgcggaacga aagtgaacag gagccgctct taggcgacac acctggaagc agagaatggg 180 
acattttaga gactgaagag cattataaga gccgatggag atctattagg attttatatc 240 
ttactatgtt tctcagcagt gtagggtttt ctgtagtgat gatgtccata tggccatatc 300 
tccaaaagat tgatccgaca gctgatacaa gttttttggg ctgggttatt gcttcatata 360 
gtcttggcca aatggtagct tcacctatat ttggtttatg gtctaattat agaccaagaa 420 
aagagcctct tattgtctcc atcttgattt ccgtggcagc caactgcctc tatgcatatc 480 
tccacatccc agcttctcat aataaatact acatgctggt tgctcgtgga ttgttgggaa 540 
ttggagcagg aaatgtagca gttgttagat catatactgc tggtgctact tcccttcagg 600 
aaagaacaag ttccatggca aacataagca tgtgtcaagc attaggtttt attctaggtc 660 
cagtttttca gacttgtttt acattccttg gagaaaaagg tgtgacatgg gatgtgatta 720 
aactgcagat aaacatgtat acaacaccag ttttacttag cgccttcctg ggaattttaa 780 
atattattct gatccttgcc atactaagag aacatcgtgt ggatgactca ggaagacagt 840 
gtaaaagtat taattttgaa gaagcaagta cagatgaagc tcaggttccc caaggaaata 900 
ttgaccaggt tgctgttgtg gccatcaatg ttctgttttt tgtgactcta tttatctttg 960 
ccctttttga aaccatcatt actccattaa caatggatat gtatgcctgg actcaagaac 1020 
aagctgtgtt atataatggc ataatacttg ctgctcttgg ggttgaagcc gttgttattt 1080 
tcttaggagt taagttgctt tccaaaaaga ttggcgagcg tgctattcta ctgggaggac 1140 
tcatcgttgt atgggttggc ttctttatct tgttaccttg gggaaatcaa tttcccaaaa 1200 
tacagtggga agatttgcac aataattcaa tccctaatac cacatttggg gaaattatta 1260 
ttggtctttg gaagtctcca atggaagatg acaatgaaag accaactggt tgctcgattg 1320 
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aacaagcctg gtgcctctac accccggtga ttcatctggc ccagttcctt acatcagctg 
tgctaatagg attaggctat ccagtctgca atcttatgtc ctatactcta tattcaaaaa 
ttctaggacc aaaacctcag ggtgtataca tgggctggtt aacagcatct ggaagtggag 
cccggattct tgggcctatg ttcatcagcc aagtgtatgc tcactgggga ccacgatggg 
cattcagcct ggtgtgtgga ataatagtgc tcaccatcac cctcctggga gtggtttaca 
aaagactcat tgctctttct gtaagatatg ggaggattca ggaataaact agctaagact 
gtgatggaaa ctacttgctg tgtggcactt cctggtctaa agctctgcta gacaattgcg 
gttgggtgca gtgtctcatg cctgtaatcc cagcactttg gaaggccaag gcaggaggat 
tgcttgaggc caggaattca agaccagcct gggaaacaaa gtgagaccct gtctttacaa 
aaaataaaaa caaaaattta aaactgtaaa aaaaaaaaaa aa 

<210> 59 
<21X> 2820 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<223> incyte ID No: $945527CBl 

<400> 59 

gcgcccagct ctgtagcctc ctccgtcgac tcagccttag gtaccggtca ggcaaaatgc 60 
ggtcctccct ggctccggga gtctggttct tccgggcctt qtccagggac agctggttcc 120 
gaggcctcat cctgctgctg accttcctaa tttacgcctg ctatcacatg tccaggaagc 180 
ctatcagtat cgtcaagagc cgtctgcacc agaactgctc ggagcagatc aaacccatca 240 
atgatactca cagtctcaat gacaccatgt ggtgcagctg ggccccattt gacaaggaca 300 
actataagga gttactaggg ggcgtggaca acgccttcct catcgcctat gccatcggca 360 
tgttcatcag tggggttttt ggggagcggc ttccgctccg ttactacctc tcagctggaa 420 
tgctgctcag tggccttttc acctcgctct ttggcctggg atatttctgg aacatccacg 480 
agctctggta ctttgtggtc atccaggtct gtaatggact cgtccagacc acaggctggc 540 
cctctgtggt gacctgtgtt ggcaactggt tcgggaaggg gaagcggggg ttcatcatgg 600 
gcatctggaa ttcccacaca tctgtgggca acatcctggg ctccctgatc gccgggatct 660 
gggtgaacgg gcagtggggc ctgtcgttca tcgtgcctgg catcattact gccgtcatgg 720 
gcgtcatcac cttcctcttc ctcatcgaac acccagaaga tgtggactgc gcccctcctc 780 
agcaccacgg tgagccagct gagaaccagg acaaccctga ggaccctggg aacagtccct 840 
gctctatcag ggagagcggc cttgagactg tggccaaatg ctccaagggg ccatgcgaag 900 
agcctgctgc catcagcttc tttggggcgc tccggatccc aggcgtggtc gagttctctc 960 
tgtgtctgct gcttgccaag ctggtcagtt acaccttcct ctactggctg cccctctaca 1020 
tcgccaatgt ggctcacttt agtgccaagg aggctgggga cctgtctaca ctcttcgatg 1080 
ttggtggcat cataggcggc atcgtggcag ggctcgtctc tgactacacc aatggcaggg 1140 
ceaccacttg ctgtgtcatg ctcatcttgg ctgcccccat gatgttcctg tacaactaca 1200 
ttggccagga cgggattgcc agctccatag tgatgctgat catctgtggg ggcctggtca 1260 
atggcccata cgcgctcatc accactgctg tctctgctga tctggggact cacaagagcc 1320 
tgaagggcaa cgccaaagcc ctgtccacgg tcacggccat cattgacggc accggctcca 1380 
taggtgcggc tetggggcct ctgctggctg ggctcatctc ccccacgggc tggaacaatg 1440 
tcttctacat gctcatctct gccgacgtcc tagcctgctt gctcctttgc cggttagtat 1500 
acaaagagat cttggcctgg aaggtgtccc tgagcagagg cagcgggtat aaagaaatat 1560 
gaggccccaa ttggaacagc agcatggagg gtcccagttg ggtccecaac gtgctcccca 1620 
tgggcaagac aatggaaact tccacaagca ggggaggcaa accctcttta ttgaacatta 1680 
gccagcccag cccagacccc agggctgcct aaggacacag agattctcca tgggaagggg 1740 
actgccaagc atgaggaaat agaagattca ggggcctgag ctctggaagc tgcaagcaaa 1800 
agggatggga ctagggctga gttgtgtctc cattttgata aggaaaggat atgctcacac 1860 
tcttgcttgt tcagattcca agacagaagg cttcacaagg ccaacgcctg gaaaatgggc 1920 
atctdtccct cccatgttaa gctttaacct ctgtaatctg cctgtatcta taggtgggca 1980 
tctcactcca tcaaaggagc ccagcctctc tttgtccctc taccatgcaa cagtcttctc 2040 
tgtgcatttc cccaagctgg gccctcttct actctccatt taggcctgtt gataactcca 2100 
ttacccaccc atcactgctg ttcctccagg gccagcacte gggcgaggca ggggagctgc 2160 
cttcggtaca taatttgaag gggcactccc tcttgggcac atgccggccc tgagtgcctc 2220 
ccttgcctca ctctgatccc ggccccataa tgtcctcagt ggaaggtgac tgggggccgg 2280 
tgctgtgggg agagtagaaa gaggggttgg catgactaaa aataccagta tgtgtattaa 2340 
gtattttgag aatgaaatgc caaggagtgc ctactgtatg ccagctctgc tctaggaatg 2400 
gagtagacag tggacacaag aaggacttac gccctgagca caggtgccaa cggtgacaag 2460 
actggcaaga cgtgagggca tgaatggttc attcaggcag ctgctgcagg tgtggtcacc 2520 
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tggtsfccatc tgctgctccc ttttccactt ttctatgtcc tccttccacc ccaagtcccg 2580 
gatcactcgc tgttttctgg ctagctcttg gcatctccat ctgagcctaa agttgcccac 2640 
tggcaccaat agattctgtt tgacctgctg tgcccatgct catctttgtc ggggcttggc 2700 
ttggcagcga gtgaggctgg catggggtgc tgaagtgggg tctcagtgag gggtgacagc 2760 
gtgctggcag tcctcacagc cctcgctcgc tctcggcgcc tcctctgcct gggctcccac 2820 

<210> 60 

<211> 3920 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<223> Incyte ID No: 6941124CB1 

<400> 60 

ccggccgtga gtgcgcgagt gtgccatggc cgcggccacc aggcgcgtgt tccatctgca 60 
gccgtgcggt gggtgcctga acccgggggc gcggcgggaa ggccgcccgg gtctccgcca 120 
ccgtctcccg tcggggaccc cggggagaac gcggatgccg tctgcctgcg cgggccgggg XBO 
ccggtgggcg agagggctgc gggattccca gccgcgggga ggagggcgcg ggtgcgggag 240 
gcccgggcgc ggcttcgcgg Ccaccgcgcg ct:tggtttca ttagggctct gctgtcgccg 300 
tcttgaattt cttaccaggt ttgaacagct gatgcagcgg gtcgagggcg gggagctcga 360 
aagtgggcag ggcggagctg tggggctccc agcgccccga gcgccgagct tttgggagcc 420 
tggcgggccg ccgcctcgga aggtctcatt ttaccttccg cccgggcatc ttcccgtccg 480 
gcgccgcggt gggagccccg gaacccctgg agttttggga gtcccgggac . ccctgaccgt 540 
gccctgcgcg gccgcagcgg tttctccctt gctctgacac gggagggcca gccctgggtc 600 
cccgccagga cccaggtttg tacgaccctt tacctgcacg tgggccgccg ggacggaagc 660 
ggagtgtgca ccggcccaac cctcccagac cctcgtcgcc cacggcctgg gacgcgcccc 720 
agcgccccct agcagatggg ctaagcaatt gaatttctga gattaaggct ggcttcccct 780 
gctatggggc cccctcccga gcactgtccc ctgcttccct ttctccctag ggtggcgtgg 840 
gttgcatccc tcggaagagc gaaggaatga gccaggtcgg ggggcgggga gacaggtgca 900 
cacaggaggt ccagggcttg gtccatgggg ctggtgacct ttctgcttcc cttgcagaaa 960 
actctcccac catgtcgcag aatggatact tcgaggattc aagctactac aagtgtgaca 1020 
cagatgacac cttcgaagcc cgagaggaga tcctggggga tgaggccttc gacactgcca 1080 
actcctccat cgtgtctggc gagagtatcc gtttcttgtc aatgtcaacc ttgagatgca 1140 
ggcccacaac actgagaatg aagcgacttc cggtggctgt gtgctcctgc acacctcccg 1200 
aaagtacctg aagttaaaga acttcaagga agagatccgt gcgcaccgcg acctagatgg 1260 
cttcctggcg caggccagca tcgtcctgaa cgagacggcc acctccctgg ataacgtgct 1320 
gcggaccatg cttcgccgct tcgcccggga ccctgacaac aatgagccca actgcaacct 1380 
ggacctgctc atggccatgc tcttcaccga tgccggggca cccatgcggg gtaaagtcca 1440 
cetgctgtca gataccatcc aaggggtcac cgccacagtg acaggggtgc ggtaccagca 1500 
gtcgtggctc tgcatcatct gtaccatgaa ggccctacag aagcggcacg tgtgcatcag 1560 
ccgcctggtt cgcccacaga actgggggga gaattcctgt gaggttcggt tcgtcatcct 1620 
ggtgctggcG ccacccaaga tgaaaagcac taagactgcg atggaggtgg cgcgcacgtt 1680 
tgccaccatg ttctcggata tcgccttccg ccagaagctc ctggagaccc gcacagagga 1740 
ggaattcaag gaggccttgg tgcatcagag acagctgctc accatggtga gccacggtcc 1800 
agtggcgccg agaacgaagg aacgcagcac agtctccctc cctgcccaca gacacccaga 1860 
gcccccaaag tgcaaggact ttgtcccttt tgggaagggc atccgggagg acatcgcacg 1920 
caggttcccc ttgtacccct tggacttcac tgatggcatt attgggaaaa acaaggctgt 1980 
gggcaaatac atcaccacca ccctgttcct ctacttcgcc tgcctcctgc ccaccatcgc 2040 
tttcgggtct ctcaatgacg agaacacaga cggggccatc gacgtgcaga agaccatagc 2100 
cgggcagagc atcgggggcc tgctctacgc gctcttctct gggcagccat tggtgattct 2160 
gctgaccacc gcgcccctgg cgctctacat ceaggtgatt cgtgCcatct gtgatgacta 2220 
tgacctggac ttcaactcct tctacgcatg gacgggcctg tggaatagtt tcttccttgc 2280 
gctttatgcc tttttcaacc tcagcctggt catgagtctc ttcaagaggt cgacggagga 2340 
gatcatcgcc ctcttcattt ccratcacgtt tgtgctggat gccgtcaagg gcacggttaa 2400 
aatcttctgg aagtactact atgggcatta cttggacgac tatcacacaa aaaggacttc 2460 
atcccttgtc agcctgtcag gcctcggcgc cagcctcaac gccagcctcc acactgccct 2520 
caatgccagc ttcctcgcca gccccacgga gctgccctcg gccacacact caggccaggc 2580 
gaecgccgtg ctcagcctcc tcatcatgct gggcacgctc tggctgggct acaccctcta 2640 
ccaattcaag aagagcccct acctgcaccc ctgcgtgcga gagatcctgt ccgactgcgc 2700 
cctgcccatc gcggtgctcg ccttctccct catcagctcc eatggcttcc gggaaatcga 2760 
gatgagcaag ttccgctaca accccagcga gagccccttt gcgatggcgc agatccagtc 2820 
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gctgtccctg agggccgtca gcggtgccat gggcctcggc ttcctgctgt ccatgctctt 2880 

cttcatcgag cagaacttgg tggccgcctt ggtgaatgca ccggagaaca ggctggtgaa 2940 

gggcactgcc taccactggg acctcctgct cctcgccatc atcaacacag ggctgtctct 3000 

gtctgggctg ccttggatcc atgccgccta cccccactcc ccgctgcacg tgcgagccct 3060 

ggccttagtg gaggagcgtg tggagaacgg acacatctat gacacgattg tgaacgtgaa 3120 

ggagacgcgg ctgacctcgc tgggcgccag cgtcctggtg ggcctgtccc tgttgctgct 3180 

gccggtcccg cttcagtgga tccccaagcc cgtgctctat ggcctcttcc tctacatcgc 3240 

gctcacctcc ctcgatggca accagctcgt ccagcgcgtg gccctgctgc tcaaggagca 3300 

gactgcgtac cccccgacac actacatccg gagggtgccc cagaggaaga tccactactt 3360 

cacgggcctg caggtgcttc agctgctgct gctgtgtgcc ttcggcatga gctccctgec 3420 

ctacatgaag atgatctttc ccctcatcat gatcgccatg atccccatcc gctatatcct 3480 

gctgccccga atcattgaag ccaagtactt ggatgtcatg gacgctgagc acaggccttg 3540 

actggcagac cctgcccacg ccccattcgc cagccctcca cgtcctccca ggctggctct 3600 

ggagctgtga ggggaggtgt aggtgtgtgg gtgactgctc tgtgctgcgc cttctcatgg 3660 

ctgactcagg cctggggcat ctgggcattg taggggtgca gtggtatgtg cccacccctc 3720 

tGccattatc ctttagcttt aggccaagag cgttgctcag ggcagcttcC gcccagggtg 3780 

ggtgggactg agcaggatgg attttctttt gataaaagag tcgatgcctg aaagagaaac 3840 

catttccttg attgtgtaag gaacttgctg gacgcacatt agagaataaa gctcctgttt 3900 

ctaggctcct aaaaaaaaaa 3920 

<210> 61 
<211> 1333 
<212> mh 

<213> Homo sapiens 
<220> 

<221> misc^f eature 

<223> Incyte ID No: 6972530CB1 

<400> 61 

tggggcaacc tgctaacagc cgctgttgtg gctttgctgt atggcggact ggtcgtgggc 60 
ttccCtccgc ggcagggcgt ttccttcgaa gggcatctgt ttggtgcgct cgccggattt 120 
attgcagcca aggtacttct atccacgccg cgcagcgggt ttaatgccgg gtgaacgcac 180 
ttgagcatcc agacgccgag gcgatgctgt cgtcaggatt caagtaccca gggagccagg 240 
ggctaaatgt cgattttcct actgctacgc atgtacgcct cgtccctctt tcatcgcttc 300 
ggttgggcag gcctgatcgt tgccttgggt gtccacctga gcaccgctta cctgggcctg 360 
gtgttgctgg gcgaacaaca cctcaccgct gccgccacgt ttatqtactt ctatctcacc 420 
accacactca ccgtcggcta tggcgacctc gcgccgcaga cgtctgctgg gcgaattttc 480 
gtggcggcct gggtcatgct cgggggcatt gcgctgctca cggcagccat cggcaaaacc 540 
accagcagtg ttatcgatgc atggagaaaa ggcatgaagg gcaaaggtga tttcaccggc 600 
aaggtcggcc ataccgtgtt gatcggctgg gagggcgcgt ccagcgagcg ggtgattgag 660 
ttgttgctgc aagacgaaac ctcgaacgac aacctgatcg tgatctgcga ttgcagcctt 720 
gaagaaaatc cgatgccggg caaagcagcg ttcatccgcg gcgaaagcct gtcttccacc 780 
gcactgttgc tgcgtgccgg tgtgcccggt gccgagcgtg tgctggtgcg aaccccgtct 840 
gatgacctga cgctggccac cgtgctggcg gtgaaccaac tgagtcccgt cgggcatgtg 900 
gtcgcccact tcaacgaaag tgaaatcgcc gcactcgcca gctcctacgc cccgcgcctg 960 
gagtgcacgt ccagcatggc aattgaaatg ctggtgcgcg cctctcagga tccgggctcg 1020 
tcggtggtca tcaatgaatt gctgtgcgtg gggcagggcg ccacccagta cctgatgaaa 1080 
ttgcccgaag cctttgaggc aacattcggc gagctgtaca cgcagatgaa ggagcgccac 1140 
aacgccaccc tcattggcta tcgcgccaaa ggtgttcagc agccgtcgat caacccgccc 1200 
agtgccacag aggtcaaagg cggcgaactc ttctacatcg cttccactcg ccttaaggaa 1260 
atctcccatg ggatggctta aacagttgat ggggcttgag gccccaaaag ccactaccga 1320 
ctcgagccga acc 1333 

<210> 62 
<211> 8487 
<212> DNA 

<213> Homo sapiens 
<22D> 

<221> misc_feature 

<223> Incyte ID No: 6991750CB1 
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<400> 62 

atgcacctat taataattta cattcaattt 
tgtgtggttg agagaaccag aggggctctg 
cagattgaaa ctgatggcat taattacctt 
attacattcc ttccttggca gagatcagag 
attcctgaac ttaatgagta tttccgtgtg 
aagctaggct caactcctac cagtggtgca 
atcaccatca aagctagtga tcatccatat 
cctcagccta aggacgcaat gaccctgcct 
gaggaagatg gagaaatcag gttattggtc 
actgcggaat ttagaacagt gtccttgaca 
gctggcacat tagaatttca accaggagaa 
gataattcta ttcctgaact ggaaaaatct 
ggagccagtc taggagtggc ttcccaaatt 
catggcgtat ttgaatttag ccctgagtca 
gggtatagca ctgttacatt aaatgttata 
ttgcattgga acatagactc tgatcctgat 
acatttgaga ttgggcagac gagcgccaat 
ccagaactgg ataaggcatt ctctgtgtca 
gctcatatta atgccacgtt aacagttttg 
ttttctgaga aaaacagacc tgttaaagtt 
ataataaggt tgaaaggcct catgggaaaa 
atggaaaaac caccttattt tccacetaat 
ataccagctt ctggatttgc tctttttgga 
tcaattttgg atgatgatga gccagaaagg 
tctactttag tagcgaaagt acagagtcgt 
aaggtagaaa ctattgcgca actaattatc 
cagctctcag caccaattgt ccgagtggca 
actagaacag gaggagcatt tgcagatgtc 
gcaatagctg gtgaagatta tagtatagct 
accagtaaag ccgtgccaat atatgtcatt 
tttcttgtgc aactgatgaa tgaaacaaca 
gcagtcatta ttattgaggc ctctgatgac 
aaacttattg tagaggaacc tgagtttaac 
aattctggga cactcggcaa tgttactgtt 
gctactggcg acctgcgagt tgtctcaggt 
caaaccttgt tgttagaggt cctggctgac 
gtgcaactaa ctgatgcctc tggtggaggt 
attattcctg ccaatgatga tccttatggt 
gttcaagagc ctctggaaag aagttcctgt 
cactttggtc ggctgttgtt gttctacagt 
atggaggaag gtcaagattt actgtcctac 
ccactttgga gaacttggat gaatgtctct 
actttgtgcc ttaaggaaca agcttgctca 
ccccagtgtt tctggatgac atcatggatc 
acctacagga aaaacatgac cagggtagca 
agtgactatg agcctgtgac aaggcaatgg 
aatctcacag tgtctattct tcctgatgat 
tctctccttg aagttcacct catgaacatt 
ggacagccaa atatttctac agttgtcata 
gtgatctaca atattagtcc caatacttcc 
cagccccaaa ccttggtgga gctgatgata 
gcagtcgaat ggcgtgttgt tggtggaaca 
ggagagattc tgacctttgc tgaaggtgaa 
gatgactctg aaccagagga tgacgaaagt 
ggaagtagaa ttttgccaag ctccgacact 
gtggcaggaa ttgttagctt tcagacagct 
attttacaat tccatgtgat aagaactttc 
aaaattattg ggcaaaatca agaactcaat 
cctgaggggt cgttgaatac aacattgttt 
gagaaagaag tataccaagt cattctgtat 
ggaatcgccc tgcttgatgc tcaaggatat 
gaaccacatg gagttttaaa ttttgctctt 



cagattgcag aggagggatc aaccatttct 60 
gattatgtgc atgtttttta caccatttca 120 
gttgatgact ttgctaatgc cagtggaact 180 
gttctgaata tatatgttct tgatgatgat 240 
acattggttt ctgcaattcc tggagatggg 300 
agcatagatc ctgaaaagga aacgactgat 360 
ggcttgctgc agttctccac agggctgcct 420 
gcaagcagcg ttccacatat cactgtggag 480 
atccgtgcac agggacttct gggaagggtg 540 
gcattcagtc ctgaggatta ccagaatgtt 600 
agatataaat acattttcat aaacatcact 660 
tttaaagttg agttgttaaa cttggaagga 720 
ctagtgacaa ttgcagcctc tgaccacgct 780 
ctctttgtca gtggaactga accagaagat 840 
agacatcatg gaactctgtc tccagtgact 900 
ggtgatctcg ccttcacctc tggcaacatc 960 
atcactgtgg agatattgcc tgacgaagac 1020 
gtcctcagtg tttccagtgg ttctttggga 1080 
gctagtgatg atccatatgg gatattcatt 1140 
gaggaagcaa cccagaacat cacactatca 1200 
gtccttgtct catatgcaac actagatgat 1260 
ttagcgagag caactcaagg aagagactat 1320 
gctaatcaga gtgaggcaac aatagctatt 1380 
tccgaatctg tctttatcga actactcaac 1440 
tcaattccaa attctccacg tcttgggcct 1500 
attgccaatg atgatgcatt tggaactctt 1560 
gaaaatcatg ttggacccat tatcaatgtg 1620 
tctgtgaagt ttaaagctgt gccaataact 1680 
tcatcagatg tggtcttgct agaaggggaa 1740 
aatgatatct atcctgaact ggaagaatct 1800 
ggaggagcca gactaggggc tttaacagag 1860 
ccctatggat tatttggttt tcagattact 1920 
tcagtgaagg taaacctgcc aataattcga 1980 
cagtgggttg ccaccattaa tggacagctt 2040 
aatgtgacct ttgcccctgg ggaaaccatt 2100 
gacgttccgg agattgaaga ggttatccaa 2160 
actattgggt tagatcgaat tgcaaatatt 2220 
acagtagcct ttgctcagat ggtttatcgt 2280 
gctaatataa ctgtcaggcg aagcggaggg 2340 
acttccgaca ttgatgtagt ggctctggca 2400 
tatgaatctc caattcaagg ggtgcctgac 2460 
gccgtggggg agcccctgta tacctgtgcc 2520 
gcgttttcat ttttcagtgc ttctgagggt 2580 
agcccagctg tcaacaattc agacttctgg 2640 
tctcttttta gtggtcaggc tgtggctggg 2700 
gccataatgc aggaaggtga tgaattcgca 2760 
ttcccagaga tggatgagag ttttctaatt 2820 
tcagccagtt tgaaaaatca gccaaccata 2880 
gcactaaatg gtgatgcctt tggagtgttt 2940 
gaagatggct tatttgttga agttcaggag 3000 
cacaggacag ggggcagctt aggtcaagtg 3060 
gctactgaag gtttagattt tataggtgct 3120 
accaaaaaga cagtcatttt aaccatcttg 3180 
atcatagtta gtttggtgta cactgaaggt 3240 
gttagagtga acattttggc caatgacaat 3300 
tccagatctg tcataggtca tgagggagaa 3360 
cctggtcgag gaaatgttac tgttaactgg 3420 
tttgctaact ttagcggaca acttttcttt 3480 
gtgcatttgt tggatgacaa cattcctgag 3540 
gatgtcagga cacaaggagt tccaccagcc 3600 
gcagctgtcc tcacagtaga agccagtgat 3660 
tcatcaagat ttgtgttact acaagaggct: 3720 
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aacataacaa ttcagctttt catcaacaga 
acatatacca cggttcctgg aatgctgagt 
gagccagaag ttgattttgt ccctatcatt 
gcagcagcca tcaacattac cattcttgag 
ctggtgaatt taactcacgt tggacttacc 
ctagattcag aaggtttgac tgcacaagtt 
gtaattgaat ggcaacaaag caggtttgaa 
gtagcccaga ggagcagaga acctcttggc 
ttggaagcac aagtggggct ggattatatc 
ggagaaaggt ataaaaatgt caatatcatg 
gaaaaatttc agctgatttt aacaaatcct 
atagccttaa ttattgtcct tgctaatgat 
agtgagcact ttttcctaag agagccaaca 
ttgtacattg ttcgggaacc tgcacaagga 
gtgacagaag tgaattcctc aaatgaatct 
gttttagaag aaggtgttcg attcaaggcc 
ccagaaatgg atgagtattt tgtttgcacc 
ggggtgcatg ttcaaaccct gataacagtt 
agtatctctg cagttgaaaa tagagccacc 
gtgtatttaa atgtatctcg aactaatggc 
ccgaaaactt gtgaggcctt taatattggt 
gaaagaaatg aagaaaagcc ttctcttaac 
ttattcctgg tacaaacaat cattattctg 
gacagccaag attatttaat cattgcaagt 
ttcaggtgga atggaggaag cttcgtgttg 
accgtggcct tgttcaacaa gggaggctct 
aggctaaact cccttttatt cagatggtct 
cctgtcagtg ggacaacaga agttgaggct 
tttgccaaaa atgtctttct aggagatcag 
ggacagtctt ccttcaggta ttttcagtct 
tccttcacac cagcctcagg aatagcccac 
ctttactgct ggaattcgga gcgtaatcaa 
tatgatgtgg cttctgttac agtaaagtcc 
9t:gggagctc attcacatat atatgagcta 
cctagttcag gtgaactgat atttgaacct 
atccttgatg atacagttcc agaaaaagaa 
aaaggaggag cagagattgg cattaatgat 
gatgcctatg gaattgttgc atttgctcag 
gagcaagata gcctagtaac cttgaacgtt 
accatagcat gggaagctga tggaagtatt 
gaaaaacgaa tgtcagcaaa gattttattt 
ctaactattc ttgctgataa tataccagag 
cgtatcacca cagaaggggt tgaggactca 
agcaagtctg ttataacaac tttgcccaat 
gctgcgtctg tcttcattag agtagcagag 
caaatagctc gagataaagg actacttggg 
aacttcttac tgcatgtcga taatcaagct 
acaataataa taatgaaaga aaacataaaa 
gatgaccttc ctgaattgga ggaaggattt 
aactctgact tctctacagg acagccaagt 
ataatgatag aagaaaatga cgatcccaga 
gctggggaag ttattactgc ctatgaggtg 
gtagtccggc tggctggaag ctttggggca 
agtgctggcc tggaagactt taaaccatct 
gttactgcaa tgatagaaat caccataatt 
ttcaatattt ccttgatcag tgttgctgga 
actgttgtta ttccacaaaa tgattctcca 
gtaatgattg atgaatccct ttcatccgat 
gtccggtccc caggaggaaa aggaaccgtc 
aaacataacc ttagtccttt gaatgggacc 
accattgtgt tgcacacact tcaagacaca 
cagctgatat caattgatga ggtagaaata 
attcggggtg ataagcgagc atcaggagaa 



gaatttggat ctctaggagc tatcaatgtc 3780 
ctgaagaacc aaacagtagg aaacctagca 3840 
ggctttctga ttttagaaga aggggaaaca 3900 
gatgatgtac cagagctaga agaatatttc 3960 
atggctgctt caacttcatt tcctcccaga 4020 
attattgatg ccaatgatgg ggcccgaggt 4080 
gtaaatgaaa cccatggaag tttaacattg 4140 
catgtttcct tatttgtgta tgctcagaat 4200 
ttcaccccaa tgattcttca ttttgctgat 4260 
attcttgatg atgacattcc agaaggagat 4320 
tctcctggac tagagctagg gaaaaataca 4380 
gacggccctg gagttctatc atttaacaac 4440 
gctctctacg tccaggagag tgttgcagta 4500 
ttgtttggaa cagtgacagt tcagttcatt 4560 
aaagatctga ctccttccaa aggctatatt 4620 
ctacaaatat ctgccatatt agacacggaa 4680 
ttgtttaatc caactggagg tgctagacta 4740 
ttgcaaaacc aggccccttt ggggctattc 4800 
tccatagaca tcgaagaagc caataggacc 4860 
attgatttgg ctgatttaaa tatagaaaat 4920 
ttttctccct actttgtgat tactcatgaa 4980 
agtgtgttta cattcacatc tggatttaaa 5040 
gaaagttctc aagtaagata ttttacttca 5100 
caaagagatg attccgaatt aactcaggtc 5160 
catcaaaaac tccctgtccg aggtgtgctg 5220 
gtgttcttag ccatttccca ggctaatgcc 5280 
ggcagtgggt ttattaactt tcaagaggtg 5340 
ttgtcttcag ccaatgatat ttacctaata S400 
aattcaattg atattttcat ctgggagatg 5460 
gtagattttg ctgctgttaa cagaatccac 5520 
atacttctta ttggccaaga tatgtcitgct 5580 
ttctcttttg ttctggaagt accttctgct 5640 
cttaattcaa gcaagaattt aatagctcta 5700 
gcctacattt ccagccattc tgactttatt 5760 
ggtgagagag aagctacaat agcagtaaat 5820 
gaatccttca aagttcaact taaaaatccc 5880 
tctgtaacaa taaccattct gtctaatgat 5940 
aattcattat ataagcaagt ggaagaaatg 6000 
gaacgcttaa aaggaacata tggccgtata 6060 
agtgatatat ttcctacctc aggagtggtg 6120 
actgaaggcc aggtactgtc aacaatcact 6180 
ttatcagagg ttgtgattgt aaccctcacc 6240 
tacaaaggtg ctactattga tcaggacaga 6300 
gactcacctt ttggcttggt gggctggcgt 6360 
cctaaagaaa acaccaccac tcttcagtta 6420 
gatattgcca ttcacttgag agctcaaccc 6480 
actgagaacg aagattatgt attgcaagaa 6540 
gaagctcatg ccgaagtttc cattttgccg 6600 
attgtcacta tcactgaggt gaacctggtg 6660 
gtgcggaggc ccggaatgga aatagctgag 6720 
ggaattttta tgtttcatgt tactagaggc 6780 
cctccaccct tgaacgttct tcaagttcct 6840 
gtaaatgttt attggaaagc atcaccagac 6900 
catgggattc ttgaatttgc agataaacag 6960 
gatgatgctg aatttgaatt gacagagacg 7020 
ggtggcagac ttggtgatga tgttgtggta 7080 
tttggagtat ttggatttga agaaaagact 7140 
gaccctgatt catatgtgac attgacggtt 7200 
cgacttgagt ggaccataga tgagaaggct 7260 
Cttcattttg atgagactga gtcccagaag 7320 
gtgttggagg aggacaggcg tttcaccatt 7380 
tctccagtaa aaggtagtgc atcaataatt 7440 
gttgggatag ctccgtcatc taggcacatc 7500 
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ctcattgggg aaccctcagc 
ccagggattt tgggggaggt 
tttgctgaaa catcaggaaa 
atacaggctt tgaacgatga 
gcagtcagtg agggaggagt 
gccagcgact ctccctatgg 
gcacagaggg ttaacatcac 
tggtacaaga cgatgagcgg 
gagctcctct ttgaagcagg 
gactatcctg aaggcccaga 
aggtaaagga gaaaggcaat 
ttctattgtg tagagattaa 
atttgtgtga cattagaaca 
ttttaatttt gttaaatata 
ggacacactc tgggtageca 
cagcctaaca gctctctaaa 
ttccatactt gtgaaagggt 

<210> 63 
<211> 3264 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 71726948CB1 

<400> 63 

cgctggaagc agctggaggc cccagttctc atctgctcag gtgtccccgg cgccttggcg 60 
aactcggcca ctccagttcc tcacgtggtg agcactcagg gcagcgggtc gattttccga 120 
ggtcccatac ctgggtttga ggggcgcggc tcgcagcggc gggtgcaggg gcgactgcca 180 
gccctcaccc cgcctcgggg tgcgttcgga ggccgacacc tggaggacgc ctccagtccc 240 
cgcgggacgc cacgcctgcg cgccagggat ccgggataag aagtgcgcgc cgggctccgg 300 
ctgcgcgccg cggggccacc agtttgcgcg cagggctcag gcgaccgtgc ggccatggac 360 
acgccacggg gcatcggcac cttcgtggtg tgggactacg tggtgttcgc gggcatgctg 420 
gtcatctcgg ccgccatcgg catctactac gccttcgctg ggggcggcca gcagacctcc 480 
aaggacttcc tgatgggcgg ccgcagaatg accgcagtgc ccgtggcgct gtccctcacc 540 
gctagcttca tgtcagccgt cactgtcctg ggcaccccct ccgaggtcta ccgttttggg 600 
gccattttta gcatctttgc cttcacctac ttctttgtgg tggtcatcag cgcggaggtc 660 
ttcctcccgg tgttctacaa actgggaatt accagcacct acgagtattt agaacttcga 720 
tttaacaaat gtgttcgtct ctgtggaaca gtcctcttca ttgttcaaac aattctgtat 780 
actggaattg ttatttatgc ccctgccctg gctttgaatc aagtcacagg atttgatctg 840 
tggggcgcgg tagtggcaac gggggtggtc tgcacattct actgcacact gggtggtctt 900 
aaagcagtta tctggacaga tgtttttcaa gttgggatca tggtggctgg atttgcatcc 960 
gtgattatac aggctgtggt gatgcaaggt ggaatcagca ctattttaaa tgatgcctat 1020 
gatggtggaa gattaaattt ctggaatttt aatcctaacc ctttgcaaag acacaccttc 1080 
tggacaatta ttataggagg gaccttcaca tggaccagca tctacggtgt caaccaatcc 1140 
caggtgcaga gatatatttc ttgtaaaagc agattccagg caaaactgtc tctctacatc 1200 
aatcttgtgg gactctgggc aatcctcaca tgctcagtgt tttgtgggct cgccctatat 1260 
tccaggtacc atgactgtga tccttggaca gccaagaaag tgtctgcacc agaccagctc 1320 
atgccttatt tggtactgga cattctgcaa gattatccag gacttcctgg actttttgtg 1380 
gcctgtgctt acagtgggac attaagcaca gtgtcctcca gtattaatgc cttagcagca 1440 
gtaactgtgg aagatctaat caaaccttac ttcagatcgc tctcagaaag gtctctgtct 1500 
tggatttccc aaggaatgag tgtggtgtat ggagccctgt gtattggaat ggctgcgctg 1560 
gcgtcactta tgggagcttt gttgcaggca gcactcagcg tatttggtat ggttggtgga 1620 
ccacttatgg gcctgttcgc tttgggcatt ttggttccct ttgccaactc aattggagca 1680 
cttgttggtc tgatggctgg atttgccatt tctctatggg ttggaattgg agctcaaata 1740 
tatcctccac ttcctgagag aacattgcca ttgcaccttg atatccaagg ctgtaacagc 1800 
acctacaatg agacaaattt gatgacaacc acagaaatgc catttactac tagtgttttt 1860 
caaatataca atgttcaaag gactccactg atggaCaact ggtattcttt atcatatctg 1920 
tacttcagca ctgttggaac tttggtaaca ttattagtgg ggatacttgt cagtttatca 1980 
acaggaggaa gaaaacagaa cttagacccc agatatatac taaccaaaga ggacttttta 2040 
tccaatcttg atacttttaa gaaaaagaag catgttttga gctataaatc acatccagtg 2100 



aaaatataat ggtaccgcta 
cacagtgttc tggaggatat 
actgacaatg cgagacgaac 
cattcccgag gaaaaaagct 
tctgagtgaa tccagcagca 
ccgatttgcc ttttcacatg 
aatcatccgt tccagtggag 
gacagcggaa gcaggcttgg 
ggagatgagg aaaagtctgc 
ggaattttct ctaacaatta 
taggaaaaag aaagcaaaga 
tactgagctc taaagcagtt 
ggacatacaa gggaaattat 
acttccctct gaaaattgaa 
ggaagttgaa tatgataacg 
tcagaggtca gctggtgtgg 
gaaggga 



ttatcagcct tgttcgaggc 7560 
tccctccttc cgtgggggaa 7620 
agtctgcagt cattgtagta 7680 
tctatgagtt tcagctcact 7740 
ctgccaacat cacggtggtg 7800 
agcaacttcg agtgtcagaa 7860 
attttggcca tgtgcgactc 7920 
attttgttcc tgcagcaggg 7980 
atgttgaaat ccttgatgat 8040 
caaaggtgga actccaggga 8100 
gcagagagaa agagagaaag 8160 
ctttccctac tctttctcag 8220 
gacaatgact gagttttttt 8280 
agtacgtgtt tctattgtgt 8340 
gcttctaatt ggtgaagaca 8400 
ctgaatttgc ttettttgat 8460 

8487 
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